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Abstract 

Hard  Constraint  Satisfaction  Problems  (HCSPs)  are  Constraint  Sadsfaction  Problems  (CSPs) 
with  very  large  search  spaces  and  very  few  solutions.  Real-life  problems  such  as  design  or 
factory  scheduling  are  examples  of  HCSPs.  These  problems  typically  involve  several  hundred 
(or  even  several  thousand)  variables,  each  with  up  to  several  hundred  possible  values,  only  a 
very  tiny  fraction  of  which  ultimately  allows  for  a  satisfying  solution.  This  pj^r  addresses  the 
issue  of  how  to  generate  advice  to  decide  which  variable  to  instantiate  next  (i.e.  variable 
ordering  heuristics),  and  which  value  to  assign  to  that  variable  (i.e.  value  ordering  heuristics)  in 
order  to  reduce  search  for  a  solution.  Our  investigation  is  conducted  in  the  domain  of  job  shop 
scheduling.  It  is  shown  that,  in  this  domain,  generic  CSP  heuristics  are  usually  not  sufficient  to 
guide  the  search  for  a  feasible  solution.  This  is  because  these  heuristics  fail  to  properly  account 
for  the  tightness  of  constraints  and/or  the  coimectivity  of  the  constraint  graph.  Instead,  a 
probabilistic  model  of  the  search  space  is  used  to  define  new  heuristics,  which  better  account  for 
these  problem  characteristics.  Experimental  results  indicate  that  these  new  heuristics  yield 
important  improvements  in  both  search  efficiency  and  search  time. 
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1  Introduction 

Hard  Constraint  Satisfaction  Problems  (HCSPs)  arc  Constraint  Satisfaction  Problems  (CSPs) 
with  very  large  search  spaces  and  very  few  solutions.  Real-life  problems  such  as  design  [26]  or 
factory  scheduling  [9, 38]  are  examples  of  HCSPs.  These  problems  typically  involve  hundreds  of 
variables,  each  with  up  to  several  hundred  possible  values,  only  a  very  tiny  fraction  of  which 
ultimately  allows  for  a  satisfying  solution.  This  p^r  addresses  the  issue  of  how  to  generate 
advice  to  decide  which  variable  to  instantiate  next  (i.e.  variable  ordering  heuristics),  and  which 
value  to  assign  to  that  variable  (i.e.  value  ordering  heuristics)  in  order  to  reduce  search  for  a 
solution.  Our  investigation  is  conduaed  in  the  domain  of  job  shop  scheduling. 

More  specifically,  we  study  a  variation  of  the  job  shop  scheduling  problem,  referred  to  as  the  job 
shop  CSP,  in  which  operations  have  to  be  performed  within  non-rclaxable  time  windows. 
Examples  of  such  problems  include  faaory  scheduling  problems,  in  which  some  operations  have 
to  be  performed  within  one  or  several  shifts,  spacecraft  mission  scheduling  problems,  in  which 
time  windows  arc  determined  by  astronomical  events  over  which  we  have  no  control,  factory 
rescheduling  problems,  in  which  a  small  set  of  operations  need  to  be  rescheduled  without 
revising  the  schedule  of  other  operations,  etc.  The  objective  assumed  in  this  study  requires 
finding  a  feasible  schedule  as  fast  as  possible.  An  adaptation  of  the  techniques  presented  in  this 
paper  to  a  Constrained  Optimization  version  of  the  problem  can  be  found  in  [38, 39]. 

The  job  shop  CSP  is  a  well-known  NP-complete  problem  [12].  Accordingly,  the  worst-case 
complexity  of  any  procedure  to  solve  this  problem  is  expeaed  to  be  exponential.  CSP  techniques 
that  interleave  search  with  consistency  enforcing  techniques  and  variable/value  ordering 
heuristics  have  been  reported  to  yield  in^rtant  increases  in  search  efficiency  when  applied  to 
other  CSPs  [15, 11,  32,  20, 6, 25, 46, 7, 10].  One  of  the  aims  of  this  study  is  to  determine  if 
similar  savings  can  be  obtained  in  the  case  of  the  job  shop  CSP,  and,  more  generally,  if,  on  the 
average,  the  CSP  paradigm  is  sufficient  to  efficiently  solve  HCSPs  like  job  shop  scheduling.  In 
order  to  address  this  difficult  question,  we  first  review  generic  variable  and  value  ordering 
heuristics  that  have  been  reported  to  perform  particularly  well  on  other  CSPs.  The  review 
suggests  that  these  heuristics  are  often  too  weak  to  solve  HCSPs  like  job  shop  scheduling.  This 
is  because  these  heuristics  fail  to  properly  account  for  the  tighmess  of  constraints  and/or  for  the 
interactions  induced  by  the  high  coimectivity  of  the  constraint  graphs  often  encountered  in  job 
shop  scheduling  problems^.  The  second  part  of  this  paper  introduces  a  probabilistic  framework, 
within  which  new  variable  and  value  ordering  heuristics  arc  defined  that  better  account  for  these 
interactions.  Our  study  suggests  that  a  key  to  defining  these  more  powerful  heuristics  lies  in  the 
ability  of  the  probabilistic  framework  to  provide  estimates  of  the  reliance  of  a  variable  on  the 
availability  of  one  of  its  remaining  values  (e.g.,  in  job  shop  scheduling,  the  reliance  of  an 
operation  on  the  availability  of  a  reservation),  and  measures  of  contention  between  variables  for 
the  allocation  of  incompatible  values  (e.g.,  in  job  shop  scheduling,  measures  of  resource 


^Constraint  graphs  are  gr^hical  representations  of  binary  CSPs  (i.e.  CSPs  with  binary  constraints)  in  which  eadi 
variable  is  represented  by  a  node,  and  binary  constraints  are  represented  by  arcs  between  two  nodes. 
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contention  between  unscheduled  operations). 

E^^iimental  results  indicate  that  these  new  heuristics  outperform  both  generic  CSP  heuristics  as 
well  as  more  specialized  heuristics  recently  developed  for  similar  job  shop  CSPs.  This  work  also 
shows  that,  despite  its  exponential  worst-case  complexity,  the  job  shop  CSP  admits  many 
instances  that  can  be  solved  efficiently.  There  remain  however  some  particularly  difficult 
problems  that  require  larger  amounts  of  search. 

Last  but  not  least,  this  smdy  strongly  suggests  that  benchmark  problems  often  used  in  the  CSP 
literature  are  not  representative  of  HCSPs  like  job  shop  scheduling.  It  is  hoped  that  this  work  will 
prompt  researchers  in  the  field  to  look  for  new  benchmark  problems  and  new  more  powerful 
heuristics  for  these  problems. 

Section  2  of  this  ptper  provides  a  formal  definition  of  the  job  shop  scheduling  CSP.  Section  3 
details  the  backtrack  search  procedure  used  in  our  smdy.  Shortcomings  of  popular  variable  and 
value  ordering  heuristics  are  respectively  discussed  in  Sections  4  and  5.  Section  6  describes  new 
variable  and  value  ordering  heuristics  based  on  a  probabilistic  model  of  the  search  space.  The 
complexity  of  these  heuristics  as  well  as  that  of  the  overall  tqrproach  are  discussed  in  Section  7. 
Experimental  results  comparing  our  new  heuristics  with  other  heuristics  discussed  in  this  paper 
are  presented  in  Section  8.  Section  9  summarizes  the  paper,  and  further  discusses  the 
implications  of  our  smdy. 

Earlier  variations  of  the  techruques  presented  in  this  paper  are  discussed  in 
[33, 34, 35, 36, 10, 37]. 
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2  The  Job  Shop  Constraint  Satisfaction  Problem 

The  job  shop  CSP  requires  scheduling  a  set  of  jobs  /=  [j^ , ...  }  on  a  set  of  physical  resources 

RES={Ri  ,--,R„ } .  Each  job  consists  of  a  set  of  operations  ]  to  be  scheduled 

according  to  a  process  routing  that  specifies  a  partial  ordering  among  these  operations  (e.g.  0‘- 
BEFORE  Oj).  This  study  assumes  job  shop  CSPs  with  tree-like  process  routings.  A  tree-like 
process  routing  is  one  whose  graph  of  precedence  constraints  forms  a  tree^.  Two  examples  of 
tree-like  process  routings  are  depicted  in  Figure  1. 


Figure  1:  Examples  of  tree-like  process  routings. 


In  the  job  shop  CSP  studied  in  this  paper,  each  job  jj  has  a  release  date  rdj  and  a  due-date  ddi 
between  which  all  its  operations  have  to  be  performed.  Each  operation  0\  has  a  fixed  duration 
du'  and  a  variable  start  time  st\.  The  domain  of  possible  start  times  of  each  operation  is  initially 
constrained  by  the  release  and  due  dates  of  the  job  to  which  the  operation  belongs.  If  necessary, 
the  model  allows  for  additional  unary  constraints  that  further  restrict  the  set  of  admissible  start 
times  of  each  operation,  thereby  defining  one  or  several  time  windows  within  which  an  operation 


has  to  be  carried  out  (e.g.  a  specific  shift  in  factory  scheduling).  In  order  to  be  successfully 
executed,  each  operation  0\  requires  p\  different  resources  (e.g.  a  milling  machine  and  a 


machinist)  (I  pj), 
which  to  choose, 

machines). 


for  each  of  which  there  may  be  a  pool  of  physical  resources  from 
/},  with  RES  (1  ^k^q\p  (e.g.  several  possible  milling 

Wij 


More  formally,  the  problem  can  be  defined  as  follows: 


^This  is  by  far  the  most  common  situation,  especially  in  faaoiy  scheduling.  Extensions  of  the  techniques 
presented  in  this  paper  to  more  general  types  of  process  routings  will  be  briefly  discussed  as  well. 
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VARIABLES: 

The  variables  of  the  problem  are: 

1.  the  operation  start  times,  1  <  i  <  rij),  and 

2.  the  resources,  (1  <  / < n,  1  ^  i  < n^,  1  <;  seleaed  for  those  resource 
requirements  for  which  an  operation  has  several  alternatives. 

In  our  search  procedure,  each  operation  is  considered  an  aggregate  variable  (or  vector)  consisting 
of  the  start  time  of  the  operation  and  each  one  of  its  resource  requirements. 

CONSTRAINTS: 

The  non-unary  constraints  of  the  problem  are  of  two  types: 

1.  Precedence  constraints  defined  by  the  process  routings  translate  into  linear 
inequalities  of  the  type:  st[+du‘-  <  stj  (i.e.  O’  BEFORE  Oj); 


2.  Capacity  constraints  that  restrict  the  use  of  each  resource  to  only  one  operation  at 
a  time  translate  into  disjunctive  constraints  of  the  form: 

(Vp  V  ^  St'.  V  stj+dUj  ^  s^.  These  constraints  simply  express 

that,  unless  they  use  different  resources,  two  operations  0*  and  Oj  cannot  overlsqj^. 
Additionally,  there  are  unary  constraints  restricting  the  set  of  possible  values  of  individual 
variables.  These  constraints  include  non-relaxable  due  dates  and  release  dates,  between  which  all 
operations  in  a  job  need  to  be  performed.  The  model  actually  allows  any  type  of  unary  constraint 
that  further  restricts  the  set  of  possible  start  times  of  an  operation.  Time  is  assumed  discrete,  i.e. 
operation  start  times  and  end  times  can  only  take  integer  values.  Finally,  each  resource 
requirement  Ri  has  to  be  selected  from  a  set  of  resource  alternatives,  c  RES, 


OBJECTIVE: 

In  the  job  shop  CSP  studied  in  this  ptq>er,  the  objective  is  to  come  up  with  a  feasible  solution  as 
fast  as  possible.  Notice  that  this  objective  is  different  from  simply  minimizing  the  number  of 
search  states  visited.  It  also  accounts  for  the  time  spent  by  the  system  deciding  which  search  state 
to  explore  next. 

EXAMPLE: 

Figure  2  depicts  a  simple  job  shop  scheduling  problem  with  four  jobs  J=  {jj  J2  J4}  and  four 
physical  resources  RES  =  { Rj  ,^2  ^  *his  example,  each  operation  has  a  single  resource 

requirement  with  a  single  possible  value.  Operation  start  times  are  the  only  variables.  For  the 


^Tbese  constraints  have  to  be  generalized  when  dealing  widi  resources  of  capacity  larger  than  one. 
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Figure  2:  A  simple  job  shop  problem  with  4  jobs.  Each  node 
is  labeled  by  the  operation  that  it  represents  and  the 
resource  required  by  this  operation. 

sake  of  simplicity,  it  is  assumed  that  all  operations  have  the  same  duration,  namely  3  time  units, 
that  all  jobs  are  released  at  time  0  and  have  to  be  completed  by  time  15  (the  minimum  makespan 
of  this  problem^).  None  of  these  simplifying  assumptions  is  required  by  the  techniques  that  will 
be  discussed;  jobs  usually  have  different  release  and  due  dates,  operations  can  have  different 
durations,  se^  eral  resource  requirements,  and  several  alternatives  for  each  of  these  requirements. 
However  simple,  this  example  will  often  turn  out  to  be  sufficient  to  highlight  the  shortcomings 
of  some  popular  CSP  heuristics.  If  necessary,  the  example  will  be  slightly  complicated  in  order 
to  emphasize  other  shortcomings  that  would  not  be  immediately  visible  otherwise. 

Notice  that,  in  this  problem,  resource  /?2  is  the  only  one  to  be  required  by  four  operations  (one 
from  each  job).  Since  all  op)erations  in  the  example  have  the  same  duration,  resource  i?2  ^ 


^Tbe  makespan  of  a  schedule  is  the  length  of  the  time  interval  that  spans  from  the  earliest  operation  start  time  to 
the  latest  operation  end  time  [2]. 


expected  to  be  a  small  bottleneck^. 


^Infoimally,  a  bottleneck  is  a  resource  or  group  of  resources  whose  utilization  is  expected  to  be  close  to  or  larger 
than  its  available  capacity. 
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3  The  Search  Procedure 

A  general  paradigm  for  solving  CSPs  relies  on  the  use  of  depth-first  backtrack  search 
[43, 14, 3, 30].  Within  this  context,  variables  or  groups  of  variables  (Le.  subproblems)  are 
successively  instantiated  (i.e.  assigned  a  value).  Each  time  a  new  variable  (or  group  of  variables) 
is  instantiated,  a  new  search  state  is  created  that  corresponds  to  a  new,  more  complete,  partial 
solution.  This  process  goes  on  imtil  either  a  complete  solution  is  obtained  or  until  a  so-called 
deadend  state  is  reached  that  cannot  be  completed  without  violating  one  or  several  problem 
constraints.  In  the  latter  case,  the  system  needs  to  undo  one  or  several  assignments  and  try 
alternative  ones,  if  there  are  any  left  (otherwise  the  problem  is  infeasible).  This  process  of 
undoing  earlier  assignments  is  known  as  backtracking.  It  results  in  lower  search  efficiency,  and, 
hence,  is  undesirable. 

In  the  worst  case,  for  NP-complete  CSPs  such  as  the  job  shop  scheduling  CSP,  exponential 
amoimts  of  backtracking  may  be  necessary  to  come  up  with  a  feasible  solution  (schedule).  In 
practice,  as  demonstrated  by  the  experimental  study  presented  in  this  p^r,  it  is  generally 
possible  to  maintain  the  average  complexity  of  the  procedure  at  a  very  low  level.  This  is 
achieved  by  interleaving  search  with  the  ^jplication  of  consistency  enforcing  techniques  and 
variableA'alue  ordering  heuristics: 

•  Consistency  Enforcing  (Checking)  Techniques:  These  techniques  are  meant  to  prune 
the  search  space  by  eliminating  local  inconsistencies  that  cannot  participate  in  a 
global  solution  [20].  This  is  done  by  inferring  new  constraints  and  adding  them  to 
the  current  problem  formulation.  If,  during  this  process,  the  domain  of  a  variable 
becomes  empty,  a  deadend  situation  has  been  identified. 

•  VariablelValue  Ordering  Heuristics:  These  heuristics  are  concerned  with  the  order 
in  which  variables  are  instantiated  and  values  assigned  to  each  variable.  As 
discussed  in  the  remainder  of  this  paper,  these  heuristics  can  have  a  great  impact  on 
search  efficiency. 

Concretely,  for  job  shop  scheduling  CSPs,  the  search  procedure  starts  in  a  search  state  in  which 
no  operation  has  been  scheduled  yet,  and  proceeds  by  scheduling  one  operation  (i.e.  aggregate 
variable)  at  a  time: 

1.  If  all  operations  have  been  scheduled  then  stop,  else  go  on  to  2; 

2.  Apply  the  consistency  enforcing  procedure; 

3.  If  a  deadend  is  detected  then  backtrack  (ie.  select  an  alternative  if  there  is  one  left 
and  go  back  to  1,  else  stop  and  report  that  the  problem  is  infeasible),  else  go  on  to 
step  4; 
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4.  Select  the  next  operation  to  be  scheduled  (so-called  variable  ordering  heuristic); 


5.  Select  a  promising  reservation  for  that  operation  (so-called  value  ordering 
heuristic) 


6.  Create  a  new  search  state  by  adding  the  new  reservation  assignment  to  the  ctirrent 
partial  schedule.  Go  back  to  1. 

The  results  reported  in  this  study  were  obtained  using  a  simple  chronological  backtracking 
scheme. 

Qearly  there  is  a  tradeoff  between  the  time  spent  enforcing  consistency  in  each  search  state  and 
the  actual  savings  achieved  in  search  dme.  As  with  other  CSPs,  on  the  average,  it  is  not  a  good 
idea  to  seek  very  high  levels  of  consistency  in  a  search  state  [22, 15, 28, 20, 6, 7,  25] .  However, 
while  for  many  CSPs  sinqjly  achieving  arc-consistency  [44, 19]  in  a  forward  checking 
[15, 25]  fashion  appears  to  be  optimal,  job  shop  scheduling  generally  entails  slightly  higher 
levels  of  consistency  with  respect  to  precedence  constraints^.  Indeed,  it  is  possible  to  achieve 
complete  arc  consistency  with  respect  to  precedence  constraints  in  0(a)  time,  where  a  is  the 
number  of  precedence  constraints  in  the  problem  [41].  As  in  PERT/CPM  [16],  this  is  done  using 
a  longest  path  algorithm  that  takes  advantage  of  the  acyclicity  of  the  precedence  grq)h  to 
produce  an  efficient  order  in  which  to  update  pairs  of  earliest/latest  possible  start  times  for  each 
unscheduled  operation^.  It  turns  out  that  this  method  actually  guarantees  decomposability 
^  [5, 8].  Hence,  in  the  absence  of  capacity  constraints  (e.g.  problems  in  which  no  two  operations 
require  the  same  resource),  updating  pairs  of  earliest/latest  possible  start  times  for  each 
unscheduled  operation  in  each  search  state  is  sufficient  to  guarantee  backtrack-fiee  search. 

Enforcing  consistency  with  respect  to  capacity  constraints  spears  to  be  more  difficult  due  to  the 
disjunctive  nature  of  these  constraints.  For  these  constraints,  a  forward  checking  type  cf 
consistency  enforcement  is  carried  out  with  respect  to  edacity  constraints  [18].  In  other  words. 


binar’  constraint  (Le.  "arc")  restricting  two  variables  is  said  to  be  arc  consistent  when  the  sets  of  remaining 
possible  Vi.  .^s  of  both  variables  are  such  that  any  value  in  die  set  of  one  variable  is  su^qxnted  by/compatible  with  at 
least  one  value  in  die  set  of  the  odier.  Achieving  arc  consistency  widi  respect  to  a  binary  constraint  requires  pruning 
all  values  that  do  not  meet  this  condition.  In  general,  for  two  variables  with  k  possible  values  eadi,  this  requires  at 
most  OilP’)  consistency  checks.  A  search  state  is  said  to  be  totaUy  arc-consistent  if  all  its  constraints  have  been 
made  arc  consistent.  Forward  checking  is  a  form  of  partial  arc-consistency  [21, 15].  It  only  requires  achieving 
arc-consistency  with  re^ct  to  binary  constraints  connecting  non-instantii^  variables  to  instantiated  ones. 
Forward  cbed^g  does  not  attempt  to  achieve  arc-consistency  between  non-instantiated  variables. 

^See  also  [40]  for  an  incremental  version  of  this  procedure,  as  new  operations  are  scheduled. 

‘a  constraint  network  is  said  to  be  decomposable  iff  every  assignment  of  values  to  any  subset  of  K  variables  that 
satisfies  all  the  constraints  among  these  K  variables  can  be  extended  by  an  assignment  of  a  value  to  any  variable  not 
in  the  subset,  in  such  a  way  that  the  resulting  set  of  1  assignments  satisfies  all  the  constraints  among  die  1 
variables  [8].  Decomposability  is  sufficient  to  ensure  backtrack-fiee  search. 
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whenever  a  resource  is  allocated  to  an  operation  over  some  time  interval,  this  procedure  marks 
that  time  interval  as  unavailable  to  all  other  operations  requiring  that  same  resource. 
Additionally,  in  order  to  rapidly  catch  some  capacity  constraint  violations,  it  was  found  useful  to 
add  a  set  of  redundant  ctq>acity  constraints  to  the  problem  formulation.  Short  of  enforcing  full 
arc-consistency,  these  constraints,  which  are  described  in  Appendix  A,  efficiently  enforce  a 
higher  level  of  arc-consistency  than  simple  forward  checking. 

Because, it  is  only  possible  to  efficiently  enforce  partial  consistency  with  respect  to  ct^acity 
constraints,  backtracking  will  sometimes  occur.  In  other  words,  the  scheduling  procedure  will 
sometimes  reach  a  search  state,  in  which  several  unscheduled  operations  competing  for  a 
resource  appear  to  each  have  some  possible  reservations  left,  while  the  total  ctq)acity  available 
on  the  resource  is  actually  insufficient  to  accommodate  all  these  operations  together.  Notice, 
however,  that  because  consistency  enforcement  with  respect  to  precedence  constraints  is 
sufficient  to  guarantee  decomposability  (with  respect  to  these  coar-traints),  backtracking  can  only 
occur  as  the  result  of  capacity  constraint  violations. 

Because  it  is  impossible  to  efficiently  guarantee  backtrack-free  search  for  job  shop  CSPs, 
variable  and  value  ordering  heuristics  are  generally  critical  in  determining  the  actual  complexity 
of  the  search  procedure.  The  next  two  sections  examine  popular  variable  and  value  ordering 
heuristics  developed  for  generic  CSPs  as  well  as  more  specialized  heuristics  and  study  their 
applicability  to  HCSPs  such  as  job  shop  scheduling. 
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4  Shortcomings  of  Popular  Variable  Ordering  Heuristics 

A  powerful  way  to  reduce  the  average  complexity  of  backtrack  search  consists  in  judiciously 
selecting  the  order  in  which  variables  are  instantiated.  The  intuition  is  diat,  by  instantiating 
difficult  variables  first,  backtrack  search  will  generally  avoid  building  partial  solutions  that  it 
will  not  be  able  to  complete  later  on.  This  reduces  the  chances  (i.e.  the  frequency)  of 
backtracking.  Instantiating  difficult  variables  first  can  also  help  reduce  the  amount  of 
backtracking  when  the  system  is  in  a  deadend  state  that  is  not  immediately  detected  by  its 
consistency  checking  mechanism.  Indeed,  by  instantiating  difficult  variables,  the  system  moves 
to  more  constrained  deadend  states  that  are  easier  to  detect  This  reduces  the  time  the  system 
wastes  attempting  to  complete  partial  solutions  that  cannot  be  completed. 

Two  types  of  variable  ordering  heuristics  are  usually  distinguished: 

1.  Fixed  variable  ordering  heuristics:  A  unique  variable  ordering  is  determined 
prior  to  starting  the  search  and  used  in  each  branch  of  the  search  tree; 


2.  Dynamic  variable  ordering  heuristics:  The  ordering  is  dynamically  revised  in 
each  search  state  in  order  to  account  for  earlier  assignments.  Different  branches  in 
the  search  tree  generally  entail  different  variable  orderings. 

Qearly  fixed  variable  orderings  require  less  computation  since  they  are  determined  once  and  for 
all.  On  the  other  hand,  dynamic  variable  ordering  heuristics  are  potentially  more  powerful 
because  of  their  ability  to  identify  difficult  variables  within  specific  search  states  rather  than  for 
the  overall  search  tree.  Many  CSP  studies  performed  on  simple  problems  such  as  N-queens  or  on 
moderate-size  problems  have  found  that  dynamic  variable  ordering  hexuistics  are  too  exp>ensive 
(e.g.  [7]).  There  are  however  more  difficult  problems,  for  which  dynamic  variable  ordering 
heuristics  can  be  expected  to  achieve  exponential  savings  in  the  average  amount  of  search 
required  to  come  up  with  a  solution  [32].  For  these  more  difficult  problems,  it  has  often  been 
suggested  that  a  simple  heuristic  known  as  the  Dynamic  Search  Rearrangement  heuristic 
(DSR)  would  generally  be  sufficient  [3, 32, 7, 13].  In  each  search  state,  DSR  looks  for  the 
variable  with  the  smallest  number  of  remaining  values,  and  selects  this  variable  to  be  instantiated 
next.  DSR  has  often  been  used  as  a  benchmark  to  det  mine  whether  it  is  worthwhile  using  a 
dynamic  variable  ordering  heuristic  for  a  given  class  of  problems.  The  experiments  presented  at 
the  end  of  this  paper  clearly  show  that  job  shop  scheduling  belongs  to  the  class  of  more  difficult 
problems  for  which  a  dynamic  variable  ordering  is  justified.  Furthermore  these  experiments 
show  that  even  DSR  is  often  insufficient  to  solve  realistic  job  shop  CSPs. 

The  scheduling  problem  introduced  in  the  previous  section  helps  rmderstand  the  shoncomings  of 
DSR.  Figure  3  dqricts  the  problem  after  the  tpplication  of  the  consistency  enforcing  procedure 
described  in  section  3. 

According  to  DSR,  there  are  six  operations  that  are  equally  good  candidates  to  be  assigned  a 
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Figure  3:  The  same  job  shop  CSP  after  consistency  labeling. 

Start  time  labels  are  represented  as  intervals.  For 
instance,  [0,6]  represents  all  start  times  between  time  0 
and  time  6,  as  allowed  by  the  time  granularity,  namely 
{0,1,2,3,4,5,6}. 

reservation  first:  O,,  Oj,  O^,  O,,  O2,  and  Oy  Indeed,  these  six  operations  all  appear  equally 
difficult  to  DSR,  as  they  each  have  seven  possible  start  times  left.  The  other  four  operations  in 
the  problem  appear  easier  as  they  each  have  ten  possible  start  times  left.  It  is  easy  to  see  that 
some  of  the  six  operations  that  tqipear  equally  difficult  to  DSR  are  in  faa  more  difficult  to 
schedule  than  others.  Consider  operations  0\  and  Oy  both  operations  require  resource  R2,  which 
is  required  by  a  total  of  four  operations.  Moreover,  in  three  cases  out  of  four,  the  operation 
requiring  resource  R2  is  the  last  operation  in  its  job.  This  high  contention  for  resource  R2 
indicates  that  O2  and  will  probably  be  difficult  to  schedule.  On  the  other  hand,  an  operation 
like  0\  confutes  only  with  one  other  operation  for  resource  R^,  namely  operation  Oy  Moreover, 
the  fact  that  Oj  is  the  first  operation  in  job  while  O3  is  the  last  operation  in  job  73,  suggests 
that  these  two  operations  are  not  very  likely  to  compete  with  each  other.  Operations  O]  and  0\ 
can  be  expected  to  be  easier  to  schedule  than  operations  O2  and  0\.  Unfortunately  DSR  is  not 
able  to  account  for  these  observations.  This  is  because  DSR  singly  counts  the  number  of 
remaining  values  of  each  variable,  but  fails  to  estimate  the  likelihood  that  these  values  remain 
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available  later  on.  Qearly  start  tunes  of  operations  competing  for  highly  contended  resomces  are 
more  likely  to  become  unavailable  than  those  of  other  operations. 

In  this  example,  the  bottleneck  resource  R2  also  corresponds  to  the  largest  clique  of  edacity 
constraints.  Therefore,  a  variable  ordering  heuristic  that  identifies  difficult  variables  (i.e.  nodes 
in  the  constraint  graph)  as  those  with  many  incident  constraints  might  actually  provide  better 
advice  than  DSR.  Several  such  variable  ordering  heuristics  have  been  proposed  in  the  literature. 
These  heuristics  are  generally  fixed  variable  ordering  heuristics,  unless  new  constraints  are 
added  to  the  problem  as  it  is  solved.  One  such  heuristic  is  the  Minimum  Width  (MW)  heuristic 
[11, 7].  MW  "orders  the  variables  from  last  to  first  by  selecting,  at  each  stage,  a  node  in  the 
constraint  graph  which  has  a  minimal  degre^  in  the  graph  remaining  after  deleting  from  the 
graph  all  nodes  which  have  been  selected  already”  [7].  A  variation  of  this  heuristic  known  as  the 
Minimum  Degree  (MD)  heuristic  simply  ranks  variables  according  to  their  degree  in  the  initial 
constraint  graph  [7].  In  the  example  depicted  in  Figure  2,  MD  would  select  to  be  scheduled 
first.  There  are  also  MW  orderings  starting  with  that  operation.  In  general,  scheduling  problems 
are  not  that  simple,  and  fixed  variable  ordering  heuristics  like  MD  or  MW  do  not  provide  very 
good  advice  either.  This  is  best  illustrated  by  slightly  modifying  the  scheduling  problem 
depicted  in  Figure  2. 

Suppose,  for  instance,  that  we  change  the  problem  and  introduce  a  fifth  resource,  say  R^. 
Suppose  also  that  we  allow  any  of  the  operations  reqtiiring  or  R3  in  the  original  problem  to 
use  R5  as  an  alternative  resource.  We  now  have: 

.Qi,=Q;,=£4,={fi,,R5) 

.nj,=£2|,=  (R3,R5) 

The  two  cliques  of  ctqjacity  constraints  corresponding  to  and  R3  are  now  subsumed  by  a 
larger  clique  of  ctqiacity  constraints^®  involving  five  operations:  0\,  d\,  d\,  and  (Figure 

4).  Due  to  the  additional  capacity  constraints  resulting  from  the  introduction  of  R^,  there  are 
now  MW  orderings  and  MC  orderings  starting  with  some  of  these  five  operations.  In  fact  the 
addition  of  R^  has  significantly  loosened  the  capacity  constraints  participating  in  the  new  clique, 
and  the  operations  connected  by  these  constraints  are  even  easier  to  schedule  than  before.  Failure 
of  MW  and  MC  to  identify  that  these  operations  are  actually  easy  to  schedule  is  due  to  the 
inability  of  these  heuristics  to  account  for  constraint  tightness,  namely  the  difficulty  of 


^Tbe  degree  of  a  node  is  die  number  of  constraints  incident  to  that  node. 

‘^^Ciqiacity  constraints  between  operations  belon^g  to  the  same  job  are  subsumed  by  precedence  constraints  in 

33  3  3 

that  job.  For  instance,  a  cqiadty  constraint  between  Oj  and  O2,  which  would  require  that  either  O,  precede  O2  or 
3  3 

O2  precede  O  j,  if  both  operations  use  R^.  This  constraint  is  subsumed  by  the  precedence  constraint  between  the  two 

3  3 

operations,  which  requires  that  O  j  always  precede  02- 


p. 


- ! -  capacity  constraint 

P, 

^  pracedance  constraint 

Figure  4:  A  new  resource  is  added  to  the  problem. 

5  stands  for/?j  or  ^5-  ^3J5  stands  for  or  R^. 

satisfying  a  specific  constraint  [24, 10].^ ^ 

Another  problem  with  variable  ordering  heuristics  described  in  the  CSP  literature  comes  from 
the  fact  that  they  treat  all  problem  constraints  uniformly.  In  many  real-life  CSPs,  different  typ>es 
of  constraints  entail  different  levels  of  consistency  checking.  This  in  turn  determines  the  types  of 
conflicts  that  can  arise  during  search,  and  affects  the  effectiveness  of  different  variable  ordering 
heuristics.  This  is  the  case  in  job  shop  scheduling,  where,  as  explained  in  section  3,  consistency 
enforcing  techniques  take  advantage  of  particular  algebraic  properties  of  precedence  constraints 
to  efficiently  ensure  that  backtracking  only  occurs  as  a  result  of  capacity  constraint  violations. 
Consequently,  the  criticality  of  an  operation  is  uniquely  a  function  of  the  diffictilty  of  finding  a 
reservation  for  that  operation  that  will  not  violate  some  capacity  constraints.  This  observation 
can  be  exploited  to  design  more  effective  variable  ordering  heuristics. 


** Another  example  of  a  variable  ordering  heuristic  that  does  not  account  for  constraint  tightness  is  the 
Max*cardinality  seardi  order  which  arbitrarily  selects  the  first  variable  to  be  instantiated,  and  then  at  each  stage 
picks  the  variable  cormected  to  the  largest  number  of  already  instantiated  variables  [21, 7].  This  heuristic  can  also  be 
seen  as  a  fixed  variation  of  DSR. 
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A  specialized  variable  ordering  heuristic  that  takes  advantage  of  this  observation  is  the  one 
developed  by  Keng  and  Yun  [17],  though  its  authors  apparently  failed  to  relate  the  strength  of 
their  heuristic  to  this  observation.  Keng  and  Yun  suggested  a  generalization  of  DSR  in  which 
each  operation  reservation  (i.e.  each  value)  is  assigned  a  survivability  measure  reflecting  its 
chance  of  satisfying  the  ct^acity  constraints  (Le.  its  chance  of  surviving  the  competition  with 
other  operations  for  the  possession  of  a  resource).  The  operation  to  be  scheduled  next  is  the  one 
with  the  smallest  global  survivability,  as  determined  by  the  sum  of  the  survivabilities  of  each  of 
its  (remaining)  possible  reservations.  Experiments  presented  at  the  end  of  this  paper,  show  that 
this  heuristic  performs  better  than  all  the  generic  heuristics  described  above.  They  also  show  that 
this  heuristic  is  quite  expensive,  as  it  requires  inspecting  aU  the  remaining  reservations  (i.e. 
values)  of  all  tinscheduled  operations'^.  In  scheduling  problems  with  several  hundred 
operations,  each  with  several  hundred  possible  start  times  and  several  possible  resources,  this 
heuristic  may  not  be  cost  effective.  More  efficient  heuristics  can  be  obtained  by  focusing  on  one 
or  a  small  number  of  cliques  of  tight  capacity  constraints,  and  selecting  the  operation  most  likely 
to  violate  a  constraint  in  these  cliques.  A  heuristic  based  on  this  idea  is  described  in  Section  5, 
which  runs  faster  than  Keng  and  Yun’s  heuristic  while  achieving  an  even  higher  search 
efficiency. 


^^Notice  also  Aat  tbis  heuristic  may  still  identify  operations  with  just  a  few  remaining  possible  reservarioos  as 
being  critical  while  in  hia  these  reservations  may  not  be  conflicting  with  die  reservations  of  any  odier  operatioa. 
This  could  be  the  case  if,  for  instance,  operation  0  j  in  the  example  in  Hgure  2  had  only  a  small  number  of  possible 
start  times.  In  fact,  the  consistoicy  «iforcing  technique  ensures  that  backtracking  will  never  be  caused  by  this 
operation,  since  there  is  no  capacity  constraint  incident  to  it 


5  Shortcomings  of  Popular  Value  Ordering  Heuristics 

Another  powerful  way  to  reduce  the  average  complexity  of  backtrack  search  relies  on  judiciously 
selecting  the  order  in  which  value  assignments  are  tried  for  a  variable.  A  good  value  ordering 
heuristic  to  minimize  backtracking  consists  in  assigning  so-called  least  constraining  values.  A 
least  constraining  value  is  one  that  is  expected  to  participate  in  many  solutions  to  the  problem  (or 
to  the  subproblem  defined  by  the  current  search  state).  By  first  trying  least  constraining  values, 
the  system  will  generally  maximize  the  number  of  values  left  to  variables  that  still  need  to  be 
instantiated,  and  hence  it  it  will  avoid  building  partial  solutions  that  cannot  be  con^leted. 

Attempting  to  exactly  compute  the  number  of  global  solutions  in  which  a  given  assignment  (i.e. 
value)  participates  would  be  futile  as  it  would  require  finding  all  solutions  to  the  problem. 
Instead  Dechter  and  Pearl  developed  a  heuristic,  called  ABT^^,  that  relies  on  tree-like  relaxations 
of  the  problem  to  approximate  the  goodness  of  a  value.  A  tree-like  relaxation  of  a  CSP  is  one 
whose  consuaint  graph  is  a  tree  that  spans  some  or  all  the  nodes  (i.e.  variables)  of  the  original 
CSP.  Within  such  relaxations,  the  number  of  solutions  in  which  a  value  participates  can  be 
efficiently  computed  in  0{nl^)  steps,  where  n  is  the  number  of  variables  in  the  CSP,  and  k  the 
maximum  number  of  possible  values  of  a  variable.  The  intuition  is  that,  if  one  can  find  a  tree¬ 
like  relaxation  that  is  close  enough  to  the  original  CSP,  a  good  value  for  the  relaxation  should 
also  be  a  good  value  for  the  original  CSP.  One  way  to  obtain  tight  tree-like  relaxations  is  by 
associating  with  each  (binaiy)  constraint  C  in  the  original  constraint  graph  a  weight  w(C)  equal 
to  the  satisfiability  of  that  constraint  (i.e.  the  number  of  paiis  of  values  that  satisfy  the 
constraint).  A  tight  tree-like  relaxation  corresponds  to  a  Minimum  Spanning  Tree  (MST)  in  the 
resulting  network. 

While  ABT  has  performed  particularly  well  on  some  CSPs,  it  does  not  appear  appropriate  for 
CSPs  such  as  job  shop  scheduling.  Indeed,  even  for  a  fixed  variable  ordering,  the  heuristic 
generally  requires  the  corr^utation  of  a  fixed  MST  for  each  of  the  n  levels  in  the  search  tree. 
This  amounts  to  n  MST  computations,  each  of  which  typically  requires  Oin^)  elementary 
computations  [42]  (hence  a  total  of  0(n^)  elementary  computations).  The  experimental  results 
presented  at  the  end  of  this  paper  indicate  that  a  fixed  variable  ordering  is  generally  not  enough 
to  efficiently  solve  job  shop  scheduling  problems.  Under  these  conditions,  it  might  even  be 
necessary  to  identify  new  tree-like  relaxations  in  each  search  state  These  computations  may 
become  quite  expensive  for  large  CSPs.  There  is  however  a  more  important  problem  with  this 
heuristic,  whether  using  minimum  spanning  tree  relaxations  or  not;  there  is  no  guarantee  that 
there  even  exists  a  tight  enough  tree-like  relaxation  of  the  CSP,  namely  a  tree-like  relaxation  that 


i^ABT  stands  for  Advised  Backtracking. 

*^This  would  also  require  updating  the  weights  of  each  constraint  in  each  search  state. 
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will  provide  sufficiently  good  advice  to  guide  search^^.  This  is  most  likely  to  be  the  case  with 
job  shop  scheduling  problems,  as  explained  below  with  an  example. 

Consido'  constraint  Pj  in  the  scheduling  problem  depicted  in  Figure  3.  is  a  precedence 
constraint  between  operation  0\  and  operation  0\.  The  set  of  start  time  pairs  ^st^)  that  satisfy 
constraint  is: 


{ (0,3),(0,4)....,(0,9),(1 ,4).(1 ,5).....(1 .9),...,(6.9)} 


In  order  to  identify  a  tight  tree-like  relaxation,  is  assigned  a  weight,  equal  to  the 

cardinality  of  that  set,  namely  vi</’j)=7+6+5+4+3+2+l=28.  Similar  computations  can  be 
performed  to  confute  the  weights  of  other  constraints.  These  weights  are  as  follows: 

•  w(Pi)=w(P2)=w(P4)=w(P5)=28 


•  w(P3)=w(Pg)=55 


•  w(Ci)=38,  w(C2)=29,  w(C3)=38 


•  w(C4)=43 


•  w(C5)=w(C6)=38  ,  w(C7)=29  ,  w(Cg)=56  ,  w(C9)=w(Cio)=38 

Figure  5  shows  an  MST  relaxation  of  the  scheduling  problem  obtained  using  these  weights.  It 
appears  that  die  MST  relaxation  includes  10  out  of  the  16  constraints  present  in  the  original  CSP. 
The  loss  of  information  initially  contained  in  the  cliques  of  capacity  constraints  is  even  more 
dramatic.  Only  2  out  of  the  6  constraints  in  the  clique  corresp>onding  to  R2  have  been  preserved. 
This  is  not  an  accident.  In  general  a  resource  required  by  M  operations  will  result  in  a  clique  of 
capacity  constraints.  At  most  M-1  of  these  capacity  constraints  can  be  preserved  in  any 

tree-like  relaxation  of  the  problem.  Under  these  conditions,  we  should  not  be  surprised  if  the 
advice  provided  by  ABT  for  job  shop  CSPs  is  not  very  effective.  Suppose  for  instance  that  the 
system  selects  Oj  to  be  instantiated  first^^.  Using  the  MST  relaxation  represented  in  Figure  5, 
ABT  would  recommend  assigning  start  time  4  to  this  operation.  A  careful  examination  of  the 
scheduling  problem  reveals  however  that  there  is  no  feasible  schedule  with  starting  at  4. 
Indeed  it  e’poars  that  if  were  to  start  at  time  4,  the  three  other  operations  requiring  resource 


‘^Tbe  experiments  reported  in  [6]  seemed  to  indicate  the  opposite.  In  these  experiments,  it  ippeared  Aat  often  the 
advice  provided  by  ABT  was  too  expensive  and  too  accurate.  Instead  advice  provided  by  looser  relaxations  ended 
up  being  more  cost-effective.  However,  these  results  were  obtained  on  rather  small  problems  with  a  relatively  high 
density  of  solutions. 

’^t  should  now  be  clear  that  this  is  a  good  choice,  since  this  operation  has  only  seven  possible  start  times  and 
requires  resource  fbe  main  bottleneck  of  the  problem. 
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Figure  5:  An  MST  relaxation  of  the  scheduling  problem. 

/?2  would  all  have  to  be  scheduled  between  time  7  and  time  15.  In  other  words,  there  would  be  8 
time  units  left  to  fit  3  operations  that  each  have  a  duration  of  3  time  units.  This  is  clearly 
impossible. 

Keng  and  Yun  have  developed  a  specialized  value  ordering  heuristic  that  can  deal  more 
effectively  with  cliques  of  capacity  constraints  [17].  This  heuristic  &st  estimates  the  overall 
need  for  each  resource  in  function  of  time.  Based  on  these  estimates,  operation  reservations  are 
ranked  according  to  how  well  they  are  expected  to  prevent  contention  with  the  resource 
requirements  of  other  operations.  As  the  result  reported  later  in  this  p^r  indicate,  Keng  and 
Yun’s  value  ordering  heuristic  generally  outperforms  ABT.  However  their  heuristic  omits  to 
leave  enough  room  to  other  operations  within  the  same  job  so  that  they  can  be  assigned  least 
constraining  reservations  as  well.  In  other  words,  Keng  and  Yun's  heuristic  only  accounts  for  the 
capacity  constraints  incident  to  the  current  operation,  but  fails  to  account  for  capacity  constraints 
at  other  operations  connected  by  precedence  constraints  to  the  current  operation. 

The  next  section  describes  a  probabilistic  model  of  the  search  space  that  better  accounts  for  the 
high  connectivity  of  constraint  graphs  typically  found  in  job  shop  scheduling,  and  for  the 
constraint  interactions  induced  by  these  graphs.  New  variable  and  value  ordering  heuristics  are 
defined  within  this  framework  that  attempt  to  remedy  the  shortcomings  identified  above. 
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6  New  Variable  and  Value  Ordering  Heuristics 
6.1  Underlying  Assumptions 

Rigorously  speaking,  good  variable  and  value  ordering  heuristics  are  heuristics  that  minimize  the 
time  required  for  search  to  complete  (i.e.  either  with  a  solution,  if  one  exists,  or  with  the  answer 
that  the  problem  is  overconstrained).  If  the  problem  is  infeasible,  search  time  is  independent  of 
the  value  ordering  hetuistic  (except  for  the  time  spent  by  the  system  ordering  values  according  to 
the  heuristic):  once  a  variable  has  been  selected,  the  system  will  have  to  try  each  one  of  its 
remaining  values  before  being  able  to  conclude  that  the  current  partial  solution  caimot  be 
completed.  In  general  variable  and  value  ordering  heuristics  affect  the  number  of  search  states 
that  are  explored,  the  average  amount  of  time  spent  enforcing  consistency  in  each  search  state, 
and  the  amount  of  time  spent  by  the  system  ordering  variables  and  values  according  to  these 
heuristics.  Variable  and  value  ordering  heuristics  may  even  affea  each  other’s  performance. 
The  complexity  of  these  interactions  precludes  the  design  of  heuristics  that  directly  minimize  the 
expected  search  time.  One  can  however  attempt  to  design  heuristics  that  aim  at  reducing  some  of 
the  factors  identified  above.  The  approach  taken  in  this  section  aims  at  developing  heuristics  that 
efficiently  reduce  the  expected  number  of  search  states  explored  by  the  system.  Assuming  that 
the  time  spent  by  the  system  enforcing  consistency  is  mainly  a  function  of  the  number  of 
operations  that  have  already  been  scheduled  (i.e.  the  depth  in  the  search  tree)  rather  than  a 
function  of  the  specific  operations  that  have  been  scheduled,  such  heuristics  are  expeaed  to 
effectively  reduce  search  time  as  well. 

In  this  section,  it  is  postulated  that  a  critical  variable  is  one  that  is  expeaed  to  cause 
backtracking,  namely  one  whose  remaining  possible  values  are  expeaed  to  conflia  with  the 
remairung  possible  values  of  other  variables.  Under  a  set  of  simplifying  independence 
assumptions,  Haralic'  and  Elliott  have  shown  that  such  a  measure  of  criticality  will  minimize  the 
expected  length  of  branches  in  the  search  tree,  and  hence  the  total  number  of  search  states  that 
need  to  be  visited  to  come  up  with  a  solution  [15]*^.  It  is  also  postulated  that  a  good  value  is  one 
that  is  expected  to  participate  in  many  solutions. 

In  the  next  subsection,  a  probabilistic  model  of  the  search  space  is  introduced  that  can  be  used  to 
tqrproximate  variable  criticality  and  value  goodness. 


’^See  [IS]  pp.  307-312.  At  the  end  of  their  proof,  the  authors  make  the  uimecessaiy  assumption  that  each  variable 
value  is  equally  likely  to  become  unavailable.  Under  this  assumption,  the  variable  with  what  they  call  the  smallest 
success  probability  (or  equivalently  the  variable  most  likely  to  create  backtracking)  is  the  one  with  the  smallest 
number  of  remaining  values.  The  authors  erqrloit  diis  result  to  motivate  their  use  of  the  Dynamic  Search 
Rearrangement  heuristic.  When  this  last  assumption  is  omitted,  Haralick  and  Elliott’s  proof  shows  that  (urxler 
several  other  simplifying  assumptions  made  earlier  in  dieir  proof)  choosing  die  variable  most  likely  to  create 
backtracking  will  minimize  the  eiqiected  length  of  each  branch  in  the  search  tree. 
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6.2  A  Probabilistic  Model  of  the  Search  Space 

Critical  variables  are  those  expected  to  cause  backtracking.  In  other  words,  a  critical  variable  is 
one  whose  values  are  expected  to  conflict  with  the  values  of  other  variables.  In  order  to 
approximate  variable  criticality,  a  probabilistic  framework  is  described  which  accounts  for  the 
chances  that  a  given  value  will  be  assigned  to  a  variable  (also  a  measure  of  the  reliance  of  a 
variable  on  the  availability  of  this  value),  and  the  chances  that  values  assigned  to  difrerent 
variables  conflict  with  each  other  (so  called  value  contention  measures).  By  only  accounting  for 
those  conflicts  that  will  not  be  prevented  by  the  consistency  enforcing  procedure,  highly 
effective  measures  of  value  contention  can  be  obtained  for  job  shop  scheduling  CSPs.  These 
measures  are  then  used  to  determine  which  variable  (Le.  operation)  to  instantiate  next  and  which 
value  (i.e.  reservation)  to  try  first  for  that  variable. 

The  job  shop  scheduling  consistency  enforcing  procedure  described  in  section  3  ensures  that,  in 
any  given  search  state,  the  only  conflicts  that  can  still  occur  are  capacity  constraint  violations 
between  currently  vmscheduled  operations.  Accordingly,  a  critical  operation  in  this  search 
procedure,  is  one  whose  resource  requirements  are  likely  to  conflict  with  the  resource 
requirements  of  other  unscheduled  operations.  In  general,  the  chances  that  an  operation’s 
resoiu-ce  requirements  conflict  with  those  of  other  unscheduled  operations  is  determined  by  the 
number  of  operations  competing  for  the  same  reservations  and  the  reliance  of  each  operation  on 
the  availability  of  each  one  of  its  possible  reservatiorts  (i.e.  values).  Typically,  operations  with 
few  possible  reservations  left  will  heavily  rely  on  the  availability  of  any  one  of  their  remaining 
reservations,  whereas  operations  with  a  handful  of  remaining  reservations  will  rely  much  less  on 
any  one  of  these  reservations. 

Accordingly,  a  probabilistic  model  is  assumed  in  which  each  reservation  p  that  remains  possible 
for  an  unscheduled  operation  0‘-  is  assigned  a  subjective  probability  cj(p)  to  be  allocated  to  that 
operation.  Because,  a  priori,  there  is  no  reason  to  believe  that  one  reservation  is  more  likely  to 
be  selected  than  another,  each  operation  reservation  is  assigned  an  equal  probability  to  be 
selected.  Qearly,  in  any  schedule,  each  operation  will  be  assigned  only  one  reservation.  Hence, 
the  reservation  distributions  are  chosen  to  be  of  the  form: 

1 

nbr‘. 

where  NBl^i  is  the  number  of  remaining  reservations  of  O'.  in  the  current  search  state.  This 
distribution  mirrors  our  intuition  that  an  operation  with  many  possible  reservations  does  not 
heavily  rely  on  any  single  one  of  its  remaining  reservations,  and  hence  the  probability  of  any 
single  one  of  these  reservations  to  be  seleaed  is  rather  low.  On  the  other  hand,  operations  with 
few  remaining  possible  reservations  are  more  likely  to  have  to  use  any  one  of  these  reservations. 
Using  these  subjective  reservation  distributions,  it  is  possible  to  estimate  the  reliance  of  an 
operation  0\  on  the  availability  of  a  resource  RES  at  time  t  as  the  probability  that  the 
reservation  allocated  to  this  operation  will  require  that  resource  at  that  time.  This  probability 
will  be  referred  to  as  the  individual  demand  of  operation  0\  for  resource  R/^  at  time  t.  It  will  be 
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denoted  It  can  be  computed  as  the  sum  of  the  probabilities  cy^(p)  of  all  remaining 

reservations  p  of  operation  0\  that  require  resource  at  time  x.  Finally,  by  adding  the 
individual  demands  of  all  unscheduled  operations  requiring  resource  an  aggregate  demand 
profile,  is  obtained  that  indicates  contention  between  unscheduled  operations  for 

resource  Rj^zs  &  function  of  time.  Alternatively,  one  can  postulate  a  stochastic  mechanism  that 
completes  the  current  partial  schedule  (in  the  current  search  state)  by  randomly  assigning  a 
reservation  to  each  unscheduled  operation  0‘-  according  to  its  oj  distribution.  D\{Ri^,x)  is  then 
the  probability  that  the  stochastic  mechanism  assigns  0\  a  reservation  that  requires  Rj^  at  time  x, 
and  is  the  expected  number  of  reservations  made  by  the  stochastic  mechanism  for  R^^  at 

time  X  (or  the  expeaed  number  of  operations  requiring  that  resource  at  that  time). 

Similar  demand  profiles  are  built  by  Keng  and  Yun’s  variable  and  value  ordering  heuristics  [17]. 
The  heuristics  that  will  be  presented  differ  from  those  of  Keng  and  Yun  in  the  way  they  exploit 
these  demand  profiles^^.  Earlier,  Muscettola  and  Smith  also  proposed  techniques  to  build 
probabilistic  demand  profiles,  based  on  a  pred^ned  variable  ordering  [23]. 

The  following  illustrates  the  construction  of  these  profiles  for  the  example  introduced  in  Figure 

2. 

Consider  operation  0\  in  the  initial  search  state  depicted  in  Figure  3.  After  enforcing 

consistency,  this  operation  has  7  possible  reservations  (i.e.  start  times  5^=3 ,4 . 9),  each  with  a 

subjective  probability  02(54)=^  *0  ^  selected.  On  the  other  hand,  0\  has  10  possible  start  times: 
5^2= 3, 4,..,,  12.  Therefore  the  subjective  probability  that  any  one  of  these  10  possible  start 
tunes  will  be  selected  is  02(5^3)  “lo'  individual  demand  of  an  operation  (?,•  for  resoiuce  R2  at 

some  time  t  can  be  computed  by  simply  adding  the  probability  of  each  reservation  that  would 
require  using  resource  R2  at  time  x,  i.e.  by  adding  the  probabilities  of  all  reservations  starting 
between  t  and  t~du\.  For  instance: 

DJ(/?2,0=  o5(x) 

i-du2  <  t  S  r 

In  particular  D\{R2,t)=lj  for  all  times  t  such  that  3  <  t  <  4.  This  is  because  there  is  only  one 

possible  start  time  that  would  cause  operation  0\  to  use  resource  /?2  at  any  of  these  times, 
namely  st\=2).  Between  time  4  and  time  5,  things  are  different  as  there  are  two  possible  start 
times  that  would  cause  0\  to  use  /?2  over  that  time  interval:sr2=3  and  st\=A.  The  demand  of  Oj 
for  resource  R2  between  time  4  and  time  5  is  Similar  computations  can  be  performed  for  the 
other  time  intervals  over  which  O2  may  require  resource  /?2-  Figiue  6  shows  the  individual 


*®The  wo*  presented  here  was  performed  concurrently  to  that  of  Keng  and  Yun  [33, 34, 10].  Notice  that  the 
interpretation  given  by  Keng  and  Yun  for  their  demand  profiles  is  not  a  probabilistic  one. 
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D\{xy.  Individual  Demand  of  0\  for  /?2 
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D\{xy.  Individual  Demand  cf  c  2  lor  /?2 
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Figure  6:  Building  /?2’s  aggregate  demand  profile  in  the  initial  search  state. 

demands  of  all  four  operations  requiring  resource  /?2,  as  well  as  the  aggregate  demand  for  that 
resource  obtained  by  adding  the  foiu  individual  demands  over  time.  As  expected,  the  two 
operations  with  only  seven  possible  start  times  (namely  0\  and  O3)  have  more  compact 
individual  demands  than  the  two  operations  with  ten  possible  start  times  (namely  Oj  and  O^). 
Notice  also,  that,  because  of  the  normalization  of  the  oj(p)  distributions,  the  total  individual 
demand  of  an  operation  with  only  one  possible  resource  Qike  all  the  operations  in  this  example) 
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is  always  equal  to  the  duration  of  that  op)eration.  Tliis  total  demand  is  simply  spread  differently 
over  time,  depending  on  the  number  of  start  times  still  available  to  the  operation. 
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Figure  7:  Aggregate  demands  in  the  initial  search  state  for  each  of  the  four  resources. 

Figure  7  displays  the  aggregate  demands  for  the  four  resources  of  the  example.  As  anticipated, 
resource  /?2  appears  to  be  the  most  contended  for. 

In  general,  building  aggregate  demand  profiles  requires  looking  at  each  remaining  reservation  of 
each  unscheduled  operation.  Hence,  in  each  search  state,  the  worst-case  complexity  of  the 
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procedure  is  0(Nk),  where  is  the  number  of  unscheduled  operations  and  k  the  number  of 
remaining  reservations  of  an  unscheduled  operation.  In  practice,  the  sets  of  remaining 
reservations  of  many  op)erations  do  not  change  from  one  search  state  to  another  and  it  is  more 
efficient  to  only  update  the  individual  demands  of  those  operations  whose  sets  have  shrunk.  The 
old  individual  demands  of  operations  whose  .'«ets  of  possible  reservations  have  shrunk  are 
subtracted  from  the  aggregate  demand  profiles,  and  the  new  individual  demands  are  added 
instead.  It  is  possible  to  perform  similar  updates  when  the  system  backtracks. 

6.3  A  Variable  Ordering  Heuristic  Based  on  Measures  of  Resource  Contention 
The  aggregate  demand  for  a  resource  over  a  time  interval  is  a  measure  of  contention  between 
unscheduled  operations  for  that  resource/time  interval.  In  general,  the  resource/time  interval  with 
the  highest  demand  (i.e.  the  one  that  is  the  most  contended  for)  can  be  expected  to  be  the  one 
where  capacity  constraints  are  most  likely  to  be  violated  Accordingly,  the  operation  with  the 
highest  contribution  to  the  demand  for  the  most  contended-for  resourceltime  interval  is 
considered  the  most  likely  to  violate  a  capacity  constraint,  since  it  is  the  one  that  relies  most  on 
the  availability  of  that  highly  contended  interval. 

Several  variations  of  this  variable  ordering  heuristic  have  been  implemented.  The  simplest  and 
often  most  effective  one  inspects  each  resource’s  aggregate  demand  profile  using  time  intervals 
of  duration  equal  to  the  average  duration  of  the  operations  requiring  that  resource.  The  heuristic 
then  picks  the  operation  with  the  largest  contribution  (i.e.  the  largest  individual  demand)  to  the 
demand  for  the  most  contended  of  these  time  intervals.  This  is  the  variable  ordering  heuristic 
used  in  the  experiments  reported  at  the  end  of  this  p^r.  It  will  be  referred  to  as  ORR,  which 
stands  for  "Operation  Resource  Reliance"  heuristic. 

Figure  7  displays  the  demand  profiles  for  R^,  R2,  R^,  and  R^,  the  four  resources  of  the  problem 
introduced  in  Section  3.2.  The  largest  demand  peak  identified  by  ORR  is  that  for  resource  R2 
between  time  8  and  1 1 ,  which  corresponds  precisely  to  the  clique  of  tight  capacity  constraints 
identified  earlier.  Figure  8  indicates  that  the  operation  with  the  largest  contribution  to  that 
demand  is  Oy  This  is  no  coincidence:  O3  competes  for  the  most  contended  resource  and  belongs 
to  the  group  of  six  operations  that  have  only  seven  possible  start  times  left  after  consistency 
checking.  Notice  that,  in  this  example,  there  are  actually  two  intervals  in  the  demand  profile  of 
/?2  that  qualify  as  most  contended  for.  [7,10[  and  [8,1 1[.  Had  the  scheduler  chosen  [7,10[  instead 
of  [8,1 1[,  it  would  have  selected  as  the  operation  to  be  scheduled  next.  In  fact,  and 
appear  equally  critical  in  this  example. 

The  ORR  heuristic  requires  looking  successively  at  each  resource,  and  each  time  interval  in  that 
resource’s  calendar,  in  order  to  identify  the  most  contended  interval.  If  there  are  m  resources  and 
if  the  scheduling  horizon  is  H,  this  requires  0{Hm)  elementary  computations. 


’^Tbese  tight  capacity  constraints  are  those  connecting  operations  that  contribute  to  the  demand  for  the  highly 
contended  resourceAime  interval. 


Aggregate  Demand  for  R2 
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D\{xy,  Individual  Demand  of  0\  for  /?2 
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Figure  8:  ORR  Heuristic:  the  most  critical  operation  is  the  one  that  relies 
most  on  the  most  contended  resourceAime  interval. 

6.4  A  Value  Ordering  Heuristic  Avoiding  Resource  Contention 
In  Section  4,  ABT  was  found  to  have  two  major  weaknesses,  when  applied  to  job  shop 
scheduling.  The  first  weakness  had  to  do  with  the  computational  overhead  involved  in  the 
determination  of  tight  tree-like  relaxations  of  the  problem,  while  the  second  one  lay  in  the 
inability  of  tree-like  relaxations  to  properly  account  for  cliques  of  capacity  constraints.  In 
contrast,  a  value  ordering  heuristic  is  now  described  that  attempts  to  avoid  resource  contention 
while  relying  on  predetermined  tree-like  relaxations.  The  piedeteimined  tree-like  relaxations  are 
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comprised  of  some  or  all  unschedviled  operations  in  the  job  to  which  the  current  critical  operation 
(i.e.  the  operation  to  be  scheduled  next)  belongs,  along  with  all  precedence  constraints  between 
these  operations.  However,  rather  than  singly  counting  the  number  of  solutions  to  the  relaxation 
in  which  a  given  reservation  assignment  participates,  the  value  ordering  heuristic  also  accounts 
for  the  probability  that  a  solution  to  the  relaxation  satires  the  digues  of  capacity  constraints, 
thereby  making  up  for  the  typical  lack  of  information  contained  in  the  predetermined  relaxation. 
The  probability  that  a  solution  (to  the  relaxation)  satisfies  the  cliques  of  capacity  constraints  (i.e. 
"survives"  resource  contention)  is  estimated  using  the  same  demand  profiles  that  were 
constmcted  for  the  variable  ordering  heuristic. 

For  job  shop  CSPs  with  tree-like  process  routings,  the  tree-like  relaxation  adopted  by  the 
hetuistic  is  corr^rised  of  all  the  unscheduled  operations  coimected  by  precedence  constraints  to 
the  current  critical  operation,  along  with  these  precedence  constraints.  Each  candidate 
reservation  assignment  (for  the  critical  operation)  is  ranked  according  to  the  number  of  solutions 
to  the  relaxation  with  which  it  is  corrqjatible  (i.e.  the  number  of  compatible  schedules  for  the  job 
to  which  the  critical  operation  belongs)  that  are  expeaed  to  satisfy  capacity  constraints  (i.e. 
"survive"  resource  contention).  The  reservation  compatible  with  the  largest  number  of  such 
schedules  is  the  one  selected  by  the  heuristic.  The  following  describes  the  jqjproximations  used 
by  the  value  ordering  heuristic  to  compute  the  probability  that  a  reservation  "survives"  resource 
contention  and  the  probability  that  a  job  schedule  survives  resource  contention.  A  dynamic 
programming  technique  to  efficiently  coimt  the  number  of  schedules  compatible  with  a  given 
reservation  and  expeaed  to  "survive"  contention  is  presented  in  Appendix  B.  This  technique  is 
an  adaptation  of  a  procedure  developed  by  Dechter  and  Pearl  for  the  ABT  heuristic  [6]  (see  also 
[31]).  It  is  shown  that  this  technique  can  be  further  speeded  up  by  taking  advantage  of  the 
linearity  of  precedence  constraints. 

6.4.1  Estimating  the  Probability  that  a  Reservation  Survives  Contention 

Let  0\  be  an  unscheduled  operation  and  p=<st\=t ,R\^=r\^^  remaining 

reservations.  The  probability  that  assigning  reservation  p  to  operation  O'.  will  not  conflia  with 
the  resource  requirements  of  other  operations  will  be  referred  to  as  the  "survivability"  of 
reservation  p  (for  Oj).  It  will  be  denoted  5urv|(p).  The  survivability  of  assigning  reservation  p  to 
o'-  is  tqjproximated  by  the  produa  of  the  probability  that  each  one  of  the  resources  required  by 
that  reservation  will  be  available  between  t  and  t+du\  (independence  assumption): 

jwrvJ(p)=  avaitfr^.^,t,t+du)  (1) 

where  avaifi(r'.j^,t,t+dUi)  stands  for  the  probability  that  resource  will  not  be  required  by  any 
other  operation  between  t  and  t+du-  (also  the  probability  that  assigning  this  resource  to  0\  will 
not  create  backtracking). 
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Let  r\^=Rp  e  RES.  The  probability  avaitfr[^,t that  resource  r\^=Rp  will  not  be  required 
by  any  other  operation  between  t  and  r+du|  can  be  tqjproximated  using  the  aggregate  demand 
profile  of  resource  Rp  (which  is  already  maintained  by  the  system  for  the  ORR  variable  ordering 
heuristic)  and  a  vector  /Zp(T),  which  is  also  maintained  by  the  system  to  keep  track  of  the  number 
of  (unscheduled)  operations  competing  for  as  a  function  of  time^.  At  any  time 
t^T<t+du\ ,  there  are  by  definition  n  {z)- 1  unscheduled  operations  competing  with  operation 

f  ^ 

O-  for  resource  Rp.  The  total  demand  of  these  other  unscheduled  operations  for  Rp  at  time  x  is 
^J**’^('^)~f^,-(Rp.'C).  Assuming  that  each  of  these  «p(x)-l  other  operations  equally  contributes 
to  this  demand,  the  probability  that  none  of  these  operations  requires  Rp  at  time  x  is  given  by: 


D‘f\z)-D\{R  z)  P' 

(1 - f - — -  ;■— ) 


«„(x)-l 


(2) 


It  is  tempting  to  approximate  avaif.(r'.ji^,t,t+du'i),  i.e.  the  probability  that  r\jk=Rp  will  be 
available  to  0\  between  t  and  t+du'-,  as  the  produa  of  the  probabilities  that  Rp  will  be  available 
to  o'-  on  each  one  of  the  du-  time  intervals  between  t  and  t+du'-.  In  general,  this  approximation  is 
too  pessimistic.  It  assumes  that  the  operations  competing  with  have  a  duration  equal  to  1,  i.e. 
that  any  of  these  operations  could  require  Rp  over  time  interval  [x ,  x  + 1  [  without  requiring  it  over 
time  interval  [x  + 1  ,x+2[  or  over  time  interval  [x-1,  x[.  Instead,  because  operations  competing 
for  Rp  generally  require  several  contiguous  time  intervals,  a  better  approximation  consists  in 
subdividing  the  calendar  of  that  reso\irce  into  buckets  of  duration  AVG{du),  where  AVGidu)  is 
the  average  duration  of  the  operations  competing  for  r'^j^^Rp.  avait.{r^j^,t,t-¥du^  is  then 

rfff  - 

tqjproximated  as  the  probability  that  0\  will  be  able  to  secure  the  time  buckets  that  it 

requires  to  fit  on  the  resource’s  calendar.  Using  Equation  (2),  this  can  be  approximated  as: 

>1KG(«„(t)-1)x. 


/ 

du- 


AVG{d“^^' 

avail ■{Rp,t ,t-ydu^  -  (1  —  /ivG(n^(T)-i)  ^ 


AVGidu) 


(3) 


where  AVG{D“J^\z)-D'.(R  z))  and  AVGinJz)-!)  are  respectively  the  average  of 

a  I  ^  ^  ^ 

D"^^\z)-D'.{Rp,z)  and  the  average  of  ripiz)- 1  over  time  interval  [r,r+duj[. 

Figure  9  depicts  reservation  survivabilities  for  the  three  operations  in  job  j^,  the  job  to  which  O3 
belongs  (the  operation  selected  to  be  scheduled  in  the  initial  search  state).  The  shape  of  these 
survivability  curves  is  easily  interpreted  by  looking  at  Figures  3  and  7.  Consider  operation  Oi\. 


^  aggregate  demand  for  R  at  time  x  whereas  n  (x)  is  the  number  of  operations  contributing  to  that 

p  r  r 
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Figure  9:  Survivability  measures  for  the  reservations  of  operations  in  job  73,  the  job 
to  which  belongs  O3,  the  current  critical  operation. 

Figure  3  indicates  that  0\  only  competes  with  one  other  operation  for  resource  R^,  namely 
operation  O3.  Because  operation  d\  has  a  duration  du\=3  and  because  the  earliest  possible  start 
time  of  operation  O]  is  st\=6,  operation  will  never  conflict  with  operation  if  it  is 
scheduled  at  =0, 1 , 2,  or  3.  This  is  why  the  sturvivability  of  each  of  these  start  times  is  equal  to 
1 .  For  start  times  =4 , 5 ,  and  6  the  probability  of  conflicting  with  a  reservation  assigned  to 

increases,  as  indicated  in  Figtue  7  by  the  higher  aggregate  demand  for  resource  between  time 
6  and  9  (the  only  times  where  a  conflict  between  the  two  operations  is  possible).  Since  the 
probability  of  such  a  conflict  remains  fairly  low  (i.e.  the  conflicts  involve  only  two  operations 
and  only  a  small  fraction  of  the  reservations  of  these  two  operations  conflict  with  each  other),  the 
survivabilities  of  start  times  stj=4,5,  and  6  remain  fairly  close  to  1  (though  smaller  than  1). 
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Operations  O2  and  O3  compete  with  more  operations  than  O,.  Accordingly,  their  reservation 
survivabilities  are  smaller.  The  shape  of  the  survivability  curves  of  these  two  operations  can  be 
inteipieted  using  similar,  though  slightly  more  complex  arguments. 

6.4.2  Estimating  the  Probability  that  a  Job  Schedule  Survives  Contention 
A  good  reservation  is  not  only  one  that  is  likely  to  survive  resource  contention  locally.  A  good 
reservation  should  also  leave  enough  room  to  other  operations  in  the  same  job  (i.e.  same  process 
routing)  so  that  they  too  can  be  allocated  good  reservations.  Such  good  reservations  can  be 
identified  by  estimating  for  each  remaining  reservation  (of  the  operation  to  schedule)  the 
expected  number  of  compatible  job  schedules  that  are  likely  to  stirvive  resource  contention  (in 
short,  the  "expected  number  of  survivable  schedules").  When  some  operations  in  the  job  have 
already  been  scheduled,  rather  than  looking  at  the  entire  job,  it  is  sufficient  to  look  at  the 
relaxation  comprised  of  all  unscheduled  operations  that  can  be  reached  from  the  current  (critical) 
operation  via  precedence  constraints  without  visiting  a  scheduled  operation.  The  goodness  of  a 
reservation  is  then  determined  by  the  expected  number  of  survivable  solutions  to  this  relaxation. 
Indeed  unscheduled  operations  that  are  completely  separated  from  the  current  (critical)  operation 
by  already  scheduled  operations  will  not  be  affected  by  the  reservation  assigned  to  the  current 
operation. 

In  order  to  proceed,  a  few  notations  need  to  be  defined: 

•  Oj:  the  cunrent  critical  operation  (i.e.  the  operation  selected  to  be  scheduled  next); 

•  p;  one  of  0\ ’s  remaining  reservations; 


•  RELAX^-  £  oh  the  set  of  operations  making  up  the  relaxation  used  by  the  heuristic. 
This  set  consists  of  0‘-  and  the  unscheduled  opwrations  that  can  be  reached  from  0[ 
via  precedence  constraints  without  visiting  a  scheduled  operation; 

•  good^(p):  the  goodness  of  assigning  p  to  expressed  as  the  expected  number  of 
survivable  solutions  to  the  relaxation; 


•  compj(p):  the  set  of  solutions  to  the  relaxation  that  are  compatible  with  the 
assignment  of  p  to  ; 

•  sol  €  comp|(p):  a  solution  to  the  relaxation  that  is  compatible  with  the  assignment  of 

ptoOj; 


•  p(Oj|5o/):  the  reservation  assigned  to  an  operation  0[  €  RELAX',  in  solution  sol. 
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Assuming  that  the  probability  that  a  solution  sol  survives  resource  contention  can  be 
a^roximated  by  the  product  of  die  probabilities  that  each  reservation  p(p[\sol)  in  sol  survives 
contention,  the  goodness  of  assigning  p  to  0\  is: 

goodJ(p)=  Y,  n  surv[(piO[\sol))  (4) 

sol  e  compfp)  g  REUx\ 


This  independence  assumption  is  equivalent  to  omitting  the  interactions  induced  by  precedence 
constraints  in  other  jobs.  It  generally  appears  to  be  sufficient,  as  suggested  by  the  results 
presented  in  the  following  section.  Notice  that,  as  a  consequence  of  this  assumption,  the  only 
reservation  survivabilities  that  need  to  be  computed  in  each  search  state  are  those  of  operations 
in  RELA}^.  £  i.e.  a  subset  of  the  operations  in  the  job  to  which  the  critical  operation  belongs. 
Expression  (4)  can  be  rewritten  to  separate  the  survivability  of  reservation  p  from  that  of  other 
operations  in  RF.IA'X'.- 

goodJ(p)  =  survJ(p)  y-  Ys  n  surv[(p(0[\sol))  (5) 

sol  e  comp^)  o'  c  R£LOrJ\io|) 


This  can  be  further  rewritten  as: 

goo^i  (p)  =  sur\\  (p)  X  compsurv'.  (p)  (6) 

where  compsury‘.{p)  is  the  number  of  solutions  compatible  with  the  assignment  of  p  to  O'-  that  are 
expected  to  survive  contention: 

compsurv\ip)=  Y  11  5urv[{p{0[\sol)) 

sol  E  coi»v^<p)  o[  E  REUlx\\\o\) 


Note  that,  in  fact,  compsurv\{p)  is  only  a  function  of  the  start  time  st\  allocated  to  0\  in 
reservation  p  an  can  therefore  be  written  as  compsurv\{t) 

In  tree-like  process  routings,  it  is  possible  to  evaluate  compsurv'.{t)  for  all  the  possible  start 
times  r  of  in  OiVjk)  steps,  where  is  the  number  of  operations  in  relaxation  RELAX'^,  and 

k  the  maximum  number  of  possible  reservations  of  an  operation.  This  is  done  using  a  dynamic 
programming  procedure  described  in  Appendix  B.  This  technique  is  an  adaptation  of  a  procedure 
described  in  [6]^^  For  non-tree-like  process  plans,  it  should  be  possible  to  remove  a  small 

^*The  complexity  of  Decbter’s  pioceduie  is  for  general  tree-like  CSPs.  Here  we  have  further  reduced  this 

complexity  to  0(\^)  by  taking  advantage  of  the  linearity  of  precedence  constraints.  If  the  model  was  to  allow  for 
odier  temporal  constraints  such  as  those  described  in  [1],  the  complexity  of  the  algorithm  would  be 
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number  of  precedence  constraints  (e.g.  precedence  constraints  that  are  not  on  a  critical  path)  to 
transform  the  process  routing  into  a  tree-like  one,  and  use  the  resulting  relaxation  to  compute 
goodness  measures. 


Figure  10:  Value  goodness  for  O3  expressed  as  the  number  of 

con^atible  job  schedules  expected  to  survive  resource  contention. 

In  the  example  discussed  earlier,  the  critical  operation  is  0\.  Since  no  operation  has  been 
scheduled  yet,  the  relaxation  used  by  the  heuristic  consists  of  all  three  operations  in  job  ^3.  Figure 
10  displays  the  goodness  measures  conqruted  using  (6).  Start  time  5/3=6  for  instance  is  only 
compatible  with  one  solution  to  the  relaxation,  namely  a  solution  in  which  5/3=3  and  5/j=0. 
Therefore,  the  goodness  of  this  start  time  is  given  by: 
good(5/3=6)=5urv](5/]=0)x5Mrv2(5/2=3)X5urv3(5/3=6).  On  the  other  hand,  start  time  5/3=?  is 
compatible  with  three  solutions  to  the  relaxation,  one  with  5/2=3  and  5/|=0,  one  with  5/^=4  and 
5/j =0,  and  one  with  5/3=4  and  5/,  =  1.  The  survivability  of  this  start  time  was  obtained  by  adding 
the  survivabilities  of  each  of  these  three  solutions. 

Start  time  5/3=  12  is  the  one  compatible  with  the  largest  number  of  survivable  solutions  to  the 
relaxation.  Hence  this  is  the  start  time  selected  by  the  value  ordering  heuristic.  By  assigning  this 
start  time  to  O3,  and  iteratively  using  the  variable  and  value  ordering  heuristics  that  were  just 
described,  the  search  procedure  easily  completes  the  schedule  without  backtracking.  This 
problem  is  relatively  easy,  and  is  also  solved  without  backtracking  by  Keng  and  Yun’s  heuristic. 

No  heuristic  is  perfect  Although  our  value  ordering  heuristic  recommends  the  right  start  time,  a 
careful  analysis  reveals  for  instance  that  its  second  best  choice,  namely  5/3=  11,  is  actually 
infeasible.  Notice  however  that,  in  the  absence  of  backtracking,  the  scheduler  does  not  need  to 
try  the  second  best  value  recommended  by  the  heuristic:  it  is  enough  for  the  first  value  to  be 
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good. 

6.43  Further  Refinement 

For  some  reservations  p,  con^surv‘.(p)  can  become  veiy  large  and  have  too  much  influence  in 
(6)  compared  to  j«rv|(p).  Consider  the  following  two  reservations  pj  and  P2'- 
•  pj:  compsMrV;(p2)= 1000  and  surv|(pj)=03 


•  P2:  comps«rv|(p2)=200  and  jurvj(P2)=  1.0 

Ideally,  a  good  value  ordering  heuristic  should  recognize  that  reservation  P2  is  better  than 
reservation  pj,  despite  the  fact  that,  according  to  Equation  (6)  good^(Pj)=500  is  larger  than 
goodJ(P2)=200.  Indeed,  in  this  example,  it  does  not  really  matter  whether  compsurv\{p)  equals 
200  or  1000:  in  either  case  there  will  certainly  be  enough  compatible  schedules.  Instead,  the 
factor  that  really  matters  here  is  the  survivability  of  the  reservation  itself  (i.e.  locally).  In  the 
experiments  reported  at  the  end  of  this  pjqier,  this  problem  was  handled  by  filtering  the  number 
of  survivable  solutions  compatible  with  a  reservation  p,  compsurv\{p).  Instead  of  relying  on 
Equation  (6),  our  value  ordering  heuristic  measures  reservation  goodness  according  to  the 
following  revised  formula; 

goodf.{p)  =  5i<rv!(p)  X  ,compsurv‘.(p))  (7) 


where  MIN  denotes  the  minimum  function  and  <t>  is  a  parameter  of  the  system  that  is  empirically 
adjusted.  By  using  a  filter  of  the  form  the  heuristic  attempts  to  ensure  that,  on  the 

average,  each  one  of  the  v^- 1  other  operations  in  the  relaxation  has  O  survivable  reservations. 

The  resulting  heuristic  will  be  referred  to  as  FSS,  which  stands  for  "Filtered  Survivable 
Schedules"  (value  ordering)  heuristic^^. 


“A  more  sophisticated  way  to  filter  compsurv’-{p)  would  consist  in  filtering  the  number  of  compatible 
leservatioos  of  each  operation  in  the  relaxatioa  This  would  ensure  diat  each  (xie  of  the  operations  in  the  relaxation 
has  enough  compatible  reservations.  In  general,  because  die  critical  operation  is  also  the  one  in  the  relaxation  whose 
reservations  are  the  least  survivable,  a  single  filter  for  all  the  other  operations  in  the  relaxation  seems  sufficient 
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7  Overall  Complexity 

In  each  search  state,  the  worst-case  complexity  of  the  look-ahead  analysis  is  0(max(Nk,Hm)), 
where  is  the  number  of  unscheduled  operations  in  the  current  search  state,  k  the  maximum 
number  of  reservations  left  to  an  operadon  in  that  state,  H  the  scheduling  horizon,  and  m  the 
number  of  resources  in  the  system.  In  general  0(Nk)  appears  to  be  the  dominant  factor.  In  the 
absence  of  backtracking  (i.e.  the  number  of  search  states  generated  by  the  system  is  equal  to  the 
number  of  operations  to  be  scheduled),  the  overall  complexity  of  the  approach  is  0(N0P^k), 
where  NOP  denotes  the  total  number  of  operations  to  be  scheduled.  Experimentation  with 
problems  of  different  sizes  suggests  that,  in  the  absence  of  backtracking,  this  is  the  tme 
complexity  of  the  approach.  When  backtracking  occurs  the  overall  complexity  of  the  procedure 
can  be  much  higher,  though  it  is  not  often  the  case. 
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8  Empirical  Evaluation 

This  section  reports  the  results  of  an  experimental  study  comparing  the  ORR  variable  ordering 
heuristic  and  FSS  value  ordering  heuristic  against  the  DSR  (Dynamic  Search  Rearrangement) 
variable  heuristic  (DSR)  [3, 15, 32],  the  ABT  (Advised  Backtracking)  value  ordering  [6],  and  the 
combination  of  variable  and  value  ordering  heuristics  developed  by  Keng  and  Yun  [17]  (and 
referred  to  as  SMU). 

8.1  Design  of  the  Test  Data 

A  set  of  60  scheduling  problems  was  randomly  generated,  each  with  5  resources  and  10  jobs  of  5 
operations  each  (i.e.  a  total  of  50  operations  per  problem).  Each  job  had  a  linear  process  routing 
specilying  a  sequence  in  which  the  job  had  to  visit  each  one  of  the  five  resources.  This  sequence 
was  randomly  generated  for  each  job,  except  for  bottleneck  resources,  which  were  each  visited 
after  a  fixed  number  of  operations  (in  order  to  fiirther  increase  resource  contention). 

Two  parameters  were  adjusted  to  cover  different  scheduling  conditions:  a  range  parameter,  RG, 
controlled  the  distribution  of  job  due  dates  and  release  dates,  and  a  bottleneck  parameter,  BK, 
controlled  the  number  of  major  botdeneck  resources.  Six  groups  of  ten  problems  were  randomly 
generated,  by  considering  three  different  values  of  the  range  parameter  and  two  different 
bottleneck  configurations.  The  value  of  a  third  parameter,  which  will  be  referred  to  as  the  slack 
parameter,  S,  had  to  be  adjusted  in  function  of  the  first  two  in  order  to  keep  demand  for 
bottleneck  tesource(s)  close  to  100%  over  the  major  part  of  each  problem^^. 

The  three  parameters  were  set  as  follows: 

•  Range  Parameter  (RG):  this  parameter  controlled  the  release  date  and  due  date 
distributions  in  each  problem.  Due  dates  were  randomly  drawn  fi-om  a  uniform 
distribution  (l+S)Aff/(l— RG,  1),  where  U{a,b)  represents  a  uniform  probability 
distribution  between  a  and  b,  M  is  an  estimate  of  the  minimum  makespan  of  the 
problem,  and  S  is  the  slack  parameter,  which  is  defined  below  in  function  of  BK  and 
RG.  The  minimum  makespan  of  the  problem  was  estimated  as 

Msz{n- l)d u »  where  n  is  the  number  of  jobs,  m  the  number  of 
resources,  Rfjf^  the  main  bottleneck  resource  (or  one  of  them  if  there  are  several) 

and  duj^  denotes  the  average  duration  of  the  operations  requiring  resource  R^.  This 
estimate  was  first  suggested  in  [29].  Similarly,  release  dates  were  randomly  drawn 
from  a  uniform  distribution  of  the  form:  (1+S)M  (7(0, RG).  Three  values  of  the 


^If  this  parameter  bad  been  fixed  while  the  other  parameters  varied,  a  large  proportion  of  the  problems  would 
have  been  either  infeasible  or  trivial. 


range  parameter  were  used  to  generate  problems''ll  RG=0.0 ,0.1 ,  and  0.2. 

•  Bottleneck  Parameter  (BK):  In  half  of  the  problems,  there  was  only  one  major 
bottleneck  (BK=l),  while  in  die  other  half  there  were  two  major  bottlenecks 
(BK=2). 

•  Slack  Parameter  (5);  For  problems  with  2  bottlenecks  or  jobs  with  different 
release  dates  and  due  dates,  the  length  of  the  problem  was  increased  to  (1  +S)M  so 
that  most  problems  remained  feasible.  The  slack  parameter  was  empirically  set  as 
S=0.1  X  (BK—  1)+RG.  At  the  same  time,  these  values  of  the  slack  factor  generally 
maintained  demand  for  the  bottlenecks  close  to  100%  over  the  major  part  of  each 
problem. 

Finally,  operation  durations  were  randomly  drawn  from  two  different  distributions,  depending  on 
whether  the  operation  required  a  botdeneck  resource  or  not.  Bottleneck  operations  had  durations 
randomly  drawn  from  a  uniform  distribution  i/(8,16)  whereas  non-bottleneck  operations  had 
their  durations  randomly  drawn  from  a  imiform  distribution  t/(3,ll).  As  a  consequence, 
operations  in  these  problems  had  a  bit  over  100  possible  start  times  (i.e.  values)  after  applying 
the  consistency  checking  procedure  to  the  initial  search  state. 

8.2  Comparison  Against  Other  Heuristics 

Five  combinations  of  variable  and  value  ordering  heuristics  were  compared: 

•  DSR  &  ABT:  the  Dynamic  Search  Rearrangement  heiuistic  combined  with  the 
Advised  BackTracldng  [6]  value  ordering  heuristic.  The  version  of  ABT  used  in 
these  experiments  was  one  based  on  the  same  predetermined  tree-like  relaxation  as 
FSS,  namely  it  used  the  process  routing  to  which  the  current  operation  belonged. 

This  version  of  ABT  was  carefully  implemented  to  nm  in  OiVjk)  steps  in  each 
search  state  (where  is  the  number  of  operations  in  the  tree-like  relaxation  and  k 
the  maximum  number  of  remaining  start  times  of  an  operation  after  consistency 
checking).  This  was  done  using  a  procedure  similar  to  the  one  implemented  in 
FSS25. 


^Due  to  tbe  moderate  size  of  the  scheduling  problems  considered  here,  larger  values  of  RG  quickly  tend  to 
produce  less  resource  contention.  This  is  also  because  tbe  slack  parameter  S  is  increased  when  RG  l^omes  larger, 
in  order  to  keep  from  generating  infeasible  problems. 

^An  implementation  of  ABT  using  MST  relaxations  would  have  been  too  slow  to  be  competitive.  It  would  have 
required  computing  constraint  satisfiabilities  aiKl  identifying  an  MST  relaxation  in  each  search  state.  Additionally, 
tbe  time  requited  to  count  the  number  of  solutions  to  a  general  MST  relaxation  would  have  been 


36 


•  DSR  &  FSS:  the  DSR  heuristic  combined  with  the  Filtered  Survivable  Schedules 
(FSS)  value  ordering  heuristic  (with  0=2.5). 

•  ORR  &  AST:  the  Operation  Resource  Reliance  (ORR)  variable  ordering  heuristic 
together  with  the  ABT  value  ordering  heuristic. 


•  ORR  &  FSS:  The  ORR  and  FSS  heuristics  (with  0=2.5)  advocated  in  this  paper. 

•  SMU:  The  variable  and  value  ordering  heuristics  developed  by  Keng  and  Yxm  at  the 
Southern  Methodist  University  [17]. 

All  combinations  of  variable  and  value  ordering  heuristics  were  run  in  a  modular  testbed  in 
which  all  common  functions  were  shared  (e.g.  consistency  enforcing  module,  backtracking 
module,  etc),  and  uimecessaiy  functions  were  bypassed  whenever  possible  (e.g.  the  construction 
of  demand  profiles  was  bypassed  in  DSR&ABT).  All  functions  were  in^lemented  with  equal 
care. 

On  each  problem,  search  was  stopped  if  it  required  more  than  500  search  states.  The 
performance  of  each  combination  of  variable  and  value  ordering  heuristics  was  compared  along 
3  dimensions: 

1.  Search  efficiency:  the  ratio  of  the  number  of  operations  to  be  scheduled  over  the 
total  number  of  search  states  that  were  explored.  In  the  absence  of  backtracking, 
only  one  search  state  is  generated  for  each  operation,  and  hence  search  efficiency  is 
equal  to  1. 


2.  Number  of  experiments  solved  in  less  than  500  search  states  each. 


3.  Average  CPU  time  per  operation  (in  seconds):  this  is  the  average  CPU  time 
required  to  successfully  schedule  an  operation.  If  a  solution  was  found,  this  time 
was  obtained  by  dividing  the  total  CPU  time  required  to  solve  the  problem  by  the 
number  of  operations  to  be  scheduled.  Otherwise,  this  time  was  approximated  by 
dividing  the  CPU  time  spent  exploring  500  search  states  by  the  number  of 
operations.  All  CPU  times  were  obtained  on  a  DECstation  5000  running 
Knowledge  Craft  on  top  of  Allegro  Common  Lisp.  Experimentation  with  an 
earlier  version  of  the  system  written  in  C  suggests  that  the  search  procedure  should 
run  10  to  20  times  faster  in  that  language. 

The  results  are  summarized  in  Table  1.  They  indicate  that  DSR  is  generally  not  sufficient  to 
solve  realistic  job  shop  scheduling  problems.  Combined  with  ABT,  this  heuristic  was  only  able 
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Performance  of  Five  Heuristics 


RG=0.0 

BK=1 


RG=0.0 

BK=2 


Overall 

Performance 


Search 

Efficiency 


Nb.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nh.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nh.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nh.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nb.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nb.  exp.  solved 


CPU  seconds 


Search 

Efficiency 


Nb.  exp.  solved 


CPU  seconds 


DSR 

&ABT 

DSR 

&FSS 

ORR 

&ABT 

0.72 

(0.42) 

0.82 

(0.38) 

0.96 

(0.06) 

8 

8 

10 

10.48 

(13.91) 

7.60 

(10.30) 

137 

(0.21) 

0.49 

(0.40) 

0.73 

(0.43) 

034 

(0.39) 

17.73 

(16.38) 


(0.44) 


9.47 

(9.73) 


0.22 

(0.27) 


2 


18.50 

(9.20) 


0J8 

(0.38) 


2 


17.14 

(822) 


031 

(0.33) 


3 


1339 

(10.28) 


0.44 

(0.41) 


29 


9.13 

(9.78) 


0.82 

(0.38) 


8 


532 

(4.98) 


0.46 

(0.46) 


9.66 

(6.48) 


032 

(0.38) 


3 


13.18 

(7.58) 


0.37 

(0.43) 


3 


12.30 

(8.40) 


0.58 

(0.45) 


33 


(0.41) 


40 


14.49  933  11.97 

(11.71)  (8.23)  (13.31) 


SMU 


(0.00) 


10 


1133 

(11.83) 


0.79 

(0.36) 


5.80 

(8.32) 


031 

(0.37) 


1836 

(11.50) 


033 

(0.44) 


16.64 

(16.34) 


0.46 

(0.40) 


5 


18.14 

(16.60) 


ORR 

&FSS 


0.96 

(0.07) 


10 


0.79 

(0.38) 


8 


7.69 

(7.59) 


0.64 

(0.46) 


939 

(7.81) 


0.71 

(0.42) 


7.10 

(6.03) 


0.46 

(0.46) 


0.99 

(0.02) 


10 


1.86 

(0.15) 


0.78 

(0.36) 


8 


6.63 

(10.07) 


037 

(0.29) 


9 


3.68 

(5.62) 


0.73 

(0.43) 


1232  9.50 

(7.99)  (12.81) 


0.75 

(0.41) 


8 


7.67 

(8.30) 


0.72 

(0.41) 


43 


8.00 

(7.13) 


0.82 

(0.38) 


8 


6.01 

(8.88) 


0.86 

(0.31) 


52 


4.91 

(8.07) 


Table  1:  Comparison  of  5  heuristics  over  6  sets  of  10  job  shop  problons. 

Standard  deviations  appear  between  parentheses. 

to  solve  29  problems  out  of  60  in  less  than  500  search  states  each.  Even  when  combined  with 
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the  FSS  value  ordering  heuristic,  DSR  only  achieved  a  search  efficiency  of  58%,  and  failed  to 
solve  27  problems  out  of  60  in  less  than  500  search  states.  These  results  not  only  suggest  that  job 
shop  scheduling  requires  a  dynamic  variable  ordering  heuristic^^.  They  also  indicate  that  the 
variable  ordering  heuristics  proposed  so  far  in  the  CSP  literature  are  often  too  shallow  for 
problems  such  as  job  shop  scheduling.  After  replacing  DSR  with  ORR  in  combination  with 
ABT,  search  efficiency  went  up  by  16%  and  1 1  additional  problems  were  solved  in  less  than  500 
search  states  each.  The  SMU  heuristic  achieved  a  higher  efficiency  of  72%  and  solved  43 
problems  out  of  60  in  less  than  500  states.  Even  this  heuristic  had  trouble  solving  many 
problems.  In  fact,  it  could  hardly  solve  more  problems  than  ORR&ABT.  ORR&FSS,  the 
variable  and  value  ordering  heuristics  advocated  in  this  paper,  yielded  an  impressive  86%  search 
efficiency,  and  solved  52  problems  out  of  60  in  less  than  500  search  states^^.  For  the  52 
experiments  that  it  was  able  to  solve,  ORR&FSS  never  generated  more  than  78  search  states  and 
never  took  over  150  CPU  seconds  to  solve  a  problem.  This  heuristic  combination  also  achieved 
important  speedups  over  all  the  other  heuristics. 

On  problems  with  larger  numbers  of  operations,  the  savings  achieved  by  ORR&FSS  qjpear  to 
become  even  more  important,  although  the  poor  performance  of  the  generic  CSP  heuristics 
precluded  systematic  experimentation  with  such  problems.  At  the  current  time,  ORR&FSS  has 
been  successfully  applied  to  several  himdred  scheduling  problems,  including  a  large  number  of 
problems  with  100  operations,  approximately  300  possible  start  times  per  operation,  and 
botdeneck  loads  close  to  100%  over  the  major  part  of  each  problem.  The  heuristics  have  also 
been  run  on  a  smaller  set  of  problems  with  200  operations.  Backtracking  remained  very  low  on 
most  of  these  problems. 


^^Expeiiments  repotted  in  [38]  were  also  peifonned  to  compare  tbe  perfonnance  of  the  ORR  heuiistic,  as  it  is 
described  here,  widi  less  dynamic  vaiiatiotis  of  this  heuiistic,  in  which  several  critical  operations  were  identified  at 
once.  These  experiments  showed  that  the  peifoimance  of  the  variable  ordering  heuristic  quickly  degrades  as  it 
becomes  less  dynamic. 

^In  a  more  recent  study,  a  combination  of  die  ORR  and  FSS  heuristics  widi  two  new  backtracking  schemes 
solved  an  even  larger  number  of  eiqietiments.  A  selective  dependency-directed  backtracking  scheme  solved  55 
problems  [45]  and  a  backtracking  heuristic  that  undoes  decisions  that  are  not  always  provably  wrong  was  able  to 
solve  all  60  problems. 
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9  Summary  and  Conclusions 

This  pjqjer  studied  a  variation  of  the  job  shop  scheduling  problem  in  which  operations  have  to  be 
performed  within  one  or  several  non-relaxable  time  windows.  Examples  of  such  problems 
include  factory  scheduling  problems  in  which  some  operations  have  to  be  performed  within  one 
or  several  shifts,  spacecraft  mission  scheduling  problems,  some  factory  rescheduling  problems, 
or  any  other  scheduling  problem  with  hard  deadlines.  These  types  of  scheduling  problems  cannot 
be  solved  with  traditional  scheduling  techniques  such  as  priority  dispatch  rules  or  similar  one- 
pass  scheduling  techniques.  Traditional  Mixed  Integer  Programming  techniques  which  could 
potentially  deal  with  these  problems  have  been  overwhelmed  so  far  by  the  combinatorial  number 
of  binary  variables  required  to  account  for  limited  resource  capacities  [27].  Instead,  this  study 
demonstrated  that  many  instances  of  these  problems  can  be  efficiently  solved  by  combining 
consistency  enforcing  techniques  and  look-ahead  techniques  to  decide  which  operation  to 
schedule  next  and  which  reservation  to  assign  to  that  operation. 

Several  variable  and  value  ordering  heuristics  to  guide  the  search  procedure  were  successively 
studied,  including  both  generic  heuristics  which  had  been  reported  to  perform  particularly  well 
on  other  CSPs  and  specialized  heuristics  developed  for  similar  scheduling  problems.  The  review 
indicated  that  generic  CSP  heuristics  are  usually  not  sufficient  to  solve  hard  CSPs  such  as  job 
shop  scheduling.  This  is  because  these  heuristics  fail  to  properly  account  for  the  constraint 
interactions  induced  by  the  high  cormectivity  of  the  cortstraint  graphs  typically  encountered  in 
job  shop  scheduling.  Instead,  a  new  probabilistic  model  of  the  search  space  was  introduced 
which  allows  to  estimate  the  reliance  of  a  variable  (e.g.  operation)  on  the  availability  of  a  value 
(e.g,  reservation),  and  the  degree  of  contention  among  uninstantiated  variables  for  the 
assignment  of  conflicting  values  (e.g.  contention  among  unscheduled  operations  for  the 
allocation  of  a  resource  over  some  time  interval).  Based  on  this  probabilistic  model,  new  variable 
and  value  ordering  heuristics  were  defined: 

1.  The  "Operation  Resource  Reliance"  (ORR)  variable  ordering  heuristic  selects  the 
operation  that  relies  most  on  the  most  contended  resourceAhne  interval,  and 


2.  The  "Filtered  Survivable  Schedules"  (FSS)  value  ordering  heuristic  assigns  to  that 
operation  the  reservation  which  is  expected  to  be  compatible  with  the  largest 
number  of  survivable  job  schedules,  i.e.  job  schedules  that  are  expected  to  survive 
resource  contention. 

Experimental  results  show  that  these  two  heuristics  can  efficiently  solve  many  job  shop 
scheduling  problems  that  could  not  be  efficiently  solved  by  prior  CSP  heuristics  (both  generic 
CSP  heuristics  and  specialized  heuristics  designed  for  similar  scheduling  problems).  The  results 
indicate  that  the  ORR  and  FSS  heuristics  not  only  yield  significant  increases  in  search  efficiency 
but  also  achieve  important  reductions  in  search  time.  Finally,  the  measures  of  value  reliance  and 
contention  presented  in  this  paper  have  also  been  adapted  to  solve  optimization  version  of  the  job 
shop  scheduling  problem  [38, 39]. 
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The  estimates  of  resource  contention  used  in  the  ORR  and  FSS  heuristics  are  based  on  several 
independence  assumptions.  More  sophisticated  versions  of  these  heuristics  have  also  been 
implemented,  which  attempt  to  better  account  for  different  dependencies,  some  using  more 
complex  analytical  models  [33,  34, 37]  others  relying  on  Monte  Carlo  simulations  [35].  The 
increase  in  search  efficiency  generally  achieved  by  these  more  sophisticated  versions  did  not 
seem  to  justify  their  heavier  computational  requirements. 

Because  the  job  shop  scheduling  CSP  is  NP-complete,  no  heuristic  can  be  expected  to  efficiently 
solve  all  instances  of  this  problem.  Nevertheless,  more  powerful  variable  and  value  ordering 
heuristics  may  allow  to  further  reduce  the  number  of  problems  that  cannot  be  solved  efficiently. 
Alternatively,  new  more  powerful  consistency  enforcing  techniques  or  more  sophisticated 
deadend  recovery  schemes  could  also  further  improve  the  efficiency  of  the  search  procedure^*. 

The  heuristics  presented  in  this  paper  are  intended  for  job  shop  scheduling  problems,  i.e. 
scheduling  problems  with  both  precedence  and  capacity  constraints.  The  probabilistic  measures 
of  reliance  and  contention  that  were  described  can  be  used  in  any  resource  allocation  problem, 
and  more  generally  any  CSP  with  disequality  constraints  (i.e.  constraints  preventing  two 
variables  from  being  assigned  the  same  value),  since  these  problems  can  be  formulated  as 
resoiurce  allocation  problems  (e.g.  the  N-queens  problem  can  be  formulated  as  a  resource 
allocation  problem  in  which  each  queenfr’ow  is  a  task  and  each  column  is  a  resource^^). 
However,  the  lessons  learned  from  this  work  go  beyond  job  shop  scheduling  and  resource 
allocation  problems.  Fundamental  weaknesses  of  generic  variable  and  value  ordering  heuristics 
often  praised  in  the  CSP  literature  were  identified.  Variable  ordering  heuristics  like  DSR  count 
the  number  of  values  left  to  each  variable  but  do  not  account  for  the  chances  that  these  values 
remain  available  in  the  future.  Variable  ordering  heuristics  like  MW  or  MC  count  the  number  of 
constraints  incident  to  a  variable  but  do  not  account  for  the  tighmess  of  these  constraints.  Value 
ordering  heuristics  like  ABT  assume  that  the  problem  admits  a  tight  tree-like  relaxation.  The 
probabilistic  model  of  the  search  space  used  to  define  the  ORR  and  FSS  heuristics  allows  to 
overcome  these  weaknesses  by  providing  a  framework  in  which  more  sophisticated 
approximations  of  variable  criticality  and  value  goodness  can  be  defined.  For  instance,  the  ORR 
and  FSS  heuristics  rely  on  efficient  probabilistic  approximations  of  resource  contention.  These 
measures  of  resource  contention  enable  the  scheduler  to  account  for  entire  cliques  of  capacity 
constraints  rather  than  rely  on  advice  based  on  tree-like  relaxations  of  these  cliques.  Last  but  not 
least,  this  work  suggests  that  benchmark  problems  often  used  in  the  CSP  literature  are  not 
representative  of  hard  problems  such  as  job  shop  scheduling.  It  is  hoped  that  this  study  will 
prompt  researchers  in  the  field  to  look  for  new  benchmark  problems  and  new  more  jxrwerful 


^More  recmt  experiments  in  which  the  ORR  and  FSS  heuristics  were  combined  widi  a  heuristic  backtracking 
scheme  that  undoes  decisions  that  are  not  always  provably  wrong  were  in  fact  able  to  efficiently  solve  all  60 
problems  of  ttiis  study.  See  also  [45]  for  experiments  with  another  backtracking  scheme. 

^Constraints  representing  the  ability  of  queens  to  attack  each  other  along  diagonals  can  be  represented  as 
constraints  furtiier  restricting  admissible  resource  assignments. 
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heuristics  for  these  problems. 

The  heuristics  discussed  in  this  paper  all  share  a  common  weakness:  they  always  perform  the 
same  analysis  indqjendently  of  the  problem  that  they  are  presented  with,  and  regardless  of  the 
difficulty  of  the  current  search  state.  A  more  flexible  approach  would  aUow  for  different 
heuristics  to  be  used  according  to  the  difficulty  of  the  problem  and  even  according  to  the 
difficult  of  the  current  search  state.  One  such  mechanism  was  implemented  in  an  earlier 
version  of  the  system  which  relied  on  Monte  Carlo  sampling  to  measure  resource  contention. 
Because  the  system  measured  resource  contention  between  feasible  job  schedules,  it  was 
possible  to  accurately  determine  if  search  had  reached  a  state  in  which  backtracking  could  no 
longer  occur,  like  ORR,  the  variable  ordering  heuristic  implemented  in  that  version  of  the 
system  was  particularly  good  at  quickly  reducing  contention.  As  a  consequence,  search  states 
where  backtracking  could  no  longer  occur  were  generally  reached  after  50%  to  80%  of  the 
operations  had  been  scheduled.  At  that  point,  the  system  would  arbitrarily  complete  the  schedule 
(i.e.  using  arbitrary  variable  and  value  orderings),  which  generally  resulted  in  important 
speedups^®.  A  similar  dynamic  switch  could  also  be  implemented  in  the  current  version  of  the 
system.  Similarly,  different  consistency  enforcing  techniques  could  be  applied  to  different 
problems,  different  search  stales,  or  even  to  different  parts  of  a  same  problem  (e.g.  enforcing 
higher  consistency  levels  with  respect  to  capacity  constraints  at  the  bottlenecks).  Preliminary 
experimentation  with  such  flexible  consistency  enforcing  techniques  has  been  reported  by 
Collinot  and  Lep^  [4]  and  Xiong,  Sadeh,  and  Sycara  [45]. 


^Consistency  enfoioement  was  sdll  canied  out  in  eadi  search  state,  and  was  at  that  point  sufficient  to  guaiamee 
(widiin  the  accuracy  of  the  Monte  Cario  sampling  method)  backtrack-free  search. 
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Appendix  A.  Redundant  Capacity  Constraints 

In  Older  to  rapidly  detect  the  violation  of  capacity  constraints,  it  was  found  useful  to  add  a  set  of 
redundant  cqjacity  constraints  to  the  problem  formulation. 

These  constraints  express  that,  if  two  operations,  Of  and  Oj,  require  the  same  resource  and  are 
constrained  in  such  a  way  that  they  each  totally  rely  on  the  availability  of  some  time  interval  on 
that  resource’s  calendar  (even  though  they  may  still  have  several  possible  start  times  left),  then 
these  two  time  intervals  cannot  overly.  Let  denote  the  p-th  resource  required  by  (7*  and 
the  ^-th  resource  required  by  .  Let  also  csr*,  /jrf  and  du*  respectively  denote  the  earliest 
possible  start  time,  latest  possible  stan  time,  and  duration  of  and  est'j,  Istj  and  du- denote  those 
of  Oj.  The  binary  constraint  between  two  operations,  O-  and  Oj,  can  then  be  formulated  as: 

(ypyqRl*R'j^)  V  (Ist^i^est^i+du.)  v  (Istj^est'j+duj) 

V  (lstj'^esif‘i+du'^)  v  >estj+du‘j) 


Figure  I-l  illustrates  a  simple  simation  where  two  operations  (7*  and  O'  violate  one  such 
constraint.  Both  operations  are  assumed  to  require  the  same  resource,  say 
eftj=estj+duj  is  (9j’s  earliest  possible  finish  time,  and  lfij=lsi^j+dUj  its  latest  possible  finish  time. 
Similarly  is  C7*’s  earliest  possible  finish  time,  and  its  latest  possible  finish  time. 
Whichever  start  time  is  assigned  to  Oj,  Oj  will  need  resource  R  between  Is/j  and  ef^j  (this  would 
not  be  the  case  if  /srj  ^  ^j).  Similarly  C7*  will  need  that  same  resource  between  to*  and  In 
Figure  1*1,  these  two  time  intervals,  namely  [Istj,  ef^Ji  and  [/5f*,  eft%,  overlap.  This  indicates  that 
the  resource  is  oversubscribed;  a  deadend  state  has  been  detected. 


These  types  of  conflicts  can  be  efficiently  avoided  by  maintaining  for  each  resource  a  calendar 
(e.g.  a  bit  vector)  that  records  each  of  the  time  intervals  on  which  an  unscheduled  operation 
totally  relies  (a  similar  but  separate  calendar  is  used  to  keep  track  of  actual  reservations).  As 
soon  as  two  operations  totally  rely  on  overlapping  resourceftime  intervals,  a  deadend  state  is 
detected^  ^ 


^‘Notice  that  this  is  not  equivalent  to  achieving  full  arc-consisteiicy.  Full  arc-consistency  would  require  pnining 
all  Stan  times  diat  have  become  unavailable  due  to  unscheduled  operations  that  totaUy  rely  on  the  availability  of 
some  resourceAime  intervals.  This  would  require  a  more  complex  procedure  in  which  some  operations  may  have  to 
be  inspected  several  times:  as  their  earliest^atest  possible  start  time  intervals  shrink,  new  operations  may  start  to 
totally  rely  on  the  availability  of  some  resourceAime  intervals,  or  operations  that  already  r^ed  totally  on  some 
resourceAime  intervals  may  see  diese  intervals  grow  longer.  This  in  turn  may  affect  the  earliest  atid/or  latest 
possible  Stan  times  of  other  qrerations,  and  so  on.  Although  these  computations  can  still  be  performed  efficiently,  it 
is  not  dear  whether,  on  the  average,  they  would  further  reduce  total  processing  time. 
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\/////y//yA  MrilMt  possible  reservation 
latest  possible  reservation 

{■■■I  Irrterval  of  total  reliance 

Figure  I-l:  A  situation  with  an  oversubscribed  resource  that  can  easily  be 

detected. 
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Appendix  B.  Counting  the  Number  of  Survivable  Schedules 

This  ^>pendix  describes  a  dynamic  programming  procedure  that  efficiently  counts  the  number  of 
survivable  job  schedules  (or  more  generally  the  number  of  survivable  solutions  to  the  relaxation 
defined  in  Section  6.4  for  the  FSS  value  ordering  heuristic)  that  are  conqiatible  with  the 
assignment  of  a  reservation  p  to  the  current  critical  operation  O].  This  number  was  referred  to  as 
compsurv\{t),  where  r  is  the  start  time  allocated  to  0\  in  reservation  p,  The  procedure  presented 
here  is  a  variation  of  a  similar  method  developed  by  Dechter  and  Pearl  for  the  ABT  value 
ordering  heuristic  [6]  (see  also  [31]).  While  a  direct  generalization  of  Dechter  and  Pearl’s 
procedure  would  have  an  OiyilP-)  complexity  (where  is  the  number  of  operations  in  the 
relaxation  used  by  the  FSS  value  ordering  heuristic,  and  k  the  maximum  number  of  possible 
reservations  of  an  operation  in  that  relaxation),  the  procedure  described  here  takes  advantage  of 
the  linearity  of  precedence  constraints  to  reduce  this  complexity  to  0{\jk). 


Figure  II-l:  A  tree-like  process  routing,  organized  with  the  current  critical  operation  as 

its  root.  Arrows  represent  precedence  constraints. 

Figure  II-l  represents  a  prototypical  tree-like  process  routing,  which  has  been  reorganized  with 
the  current  critical  operation  as  the  root  of  the  tree.  The  arrows  represent  precedence  constraints 
between  operations  in  the  process  routing.  The  children  of  the  critical  operation  0\  in  the  tree  are 
those  operations  that  are  directly  connected  to  0\  by  a  precedence  constraint,  the  grandchildren 
the  operations  directly  connected  to  these  operations  by  precedence  constraints,  etc. 

All  the  computations  presented  in  this  appendix  refer  to  a  single  search  state,  in  which 
consistency  checking  has  already  been  performed.  The  notations  are  those  used  in  Section  6.4. 
A  few  extra  notations  need  to  be  defined: 

•  sun/p{t)=^^  g  jjSurv^(p),  where  G  is  the  set  of  remaining  reservations  of  O'^  with 

stp=r. 


•  aj,:  the  direct  children  of  that  are  after  in  the  process  routing; 

•  Pj,:  the  direct  children  of  0|,  that  are  before  Oj,  in  the  process  routing; 

•  A:  the  time  granularity  of  the  problem.  In  Section  6.4,  it  was  assumed  that  A  =  1  (i.e. 
that  all  start  times  and  end  times  have  to  be  integers).  For  the  sake  of  clarity,  the 
foimulas  presented  in  this  appendix  account  explicitly  for  A. 

In  tree-like  process  routings,  each  operation  Oj,  is  the  unique  link  between  otherwise  disjoint  sets 
of  operations,  that  each  correspond  to  one  of  its  children.  Each  of  these  sets  contains  exactly  one 
child  of  operation  O'^  and  defines  a  subproblem  that  only  interacts  with  the  other  subproblems  via 
operation  Accordingly, 

•  For  each  Oj  e  pj,,  we  define  BEF'^j(t)  as  the  number  of  survivable  solutions  to  the 
subproblem  defined  by  operation  Oj  and  its  descendants  that  are  compatible  with  the 
assignment  of  to  O'^; 


•  For  each  0[  €  aj,,  we  define  as  the  number  of  survivable  solutions  to  the 

subproblem  defined  by  operation  and  its  descendants  that  are  compatible  with  the 
assignment  of  jf^=r  to  <3^; 


Given  that  operation  0\  is  the  only  link  between  the  subproblems  defined  by  each  one  of  its 
children,  we  have: 

compsunr\{t)=  JJ  5£'Fj^<r)  x 

je  p|  *€  oj 

Notice  that  this  formula  also  relies  on  an  independence  assumption  made  in  Section  6.4;  the 
probability  that  a  solution  to  the  relaxation  survives  contention  is  assumed  to  be  given  by  the 
product  of  the  probabilities  that  each  one  of  the  reservation  assignments  in  that  solution  survives 
contention. 

BEF^-^p)  is  obtained  by  adding  all  the  subproblem  solutions  compatible  with  the  precedence 
constraint  .rtj+duj  ^  r; 

BEI^,p)=  X  [5urv;(x)x  n  BEF^.p)y^  J]  AFt.p)\ 

XSt-dUj  pj  oj 

Similaiiy  for  AFf-p),  we  have: 
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We  can  speed  up  the  computation  of  this  recuimnce  using  partial  sums: 

BEF^,p)^BEf,p-^)+{sw^]it-du])x  BEF^.p-du')x  AF7^^(r-d«')] 

J  I 

pe  Pj  qe  aj 

AFf^p)=AFf^^(,t+A)+[surv‘p+du‘;)x  Jl  BEP'jit+du^)  x  AFfi^(t+du\)] 

J  I 

56  P*  uea^ 

The  recurrence  is  initialized  with: 

BEF'jpst‘-A)=0 
AFtj,lst[+ A)=Q 

and  uses  the  convention: 

n=i 

0 

In  order  to  compute  compsurv‘.{t)  for  all  remaining  start  times  of  the  critical  operation  0\,  the 
system  starts  by  computing  all  BEF^-p)  or  aU  AFfpt)  at  the  leaf  operations  in  the  tree  depicted 
in  Figure  II-l.  The  procedure  then  moves  up  in  the  tree  by  combining  at  each  level  the  BEF'jp) 
and  AFT'jp)  computed  at  the  previous  level.  At  each  operation  in  the  tree,  the  procedure 
computes  at  most  X  BEF^jp)  expressions  if  is  before  Oj,  its  parent  operation,  or  X  AFfjp) 
expressions,  if  Oj,  is  after  Oj  (where  X,  is  the  maximum  number  of  possible  start  times  of  an 
operation).  Each  such  computation  involves  2  multiplications  and  1  addition.  Hence,  if  is  the 
number  of  operations  in  the  relaxation  used  by  the  FSS  value  ordering  heuristic,  computing  all 
compsurv\(t)  can  be  done  in  0(VjX)  elementary  corrqrutations.  Computing 
survp)=^^  ^  QSurv‘p>)  for  all  the  possible  start  times  of  all  the  operations  in  the  relaxation 
requires  however  0(V;fc)  steps  where  ^  is  the  maximum  number  of  reservations  left  to  an 
operation^^.  Hence  the  overall  complexity  of  the  procedure  is  also  0(v^/t). 


^^Tbe  real  complexity  is  actually  OiVfkdu),  where  du  is  the  duration  of  the  longest  operation  in  die  relaxation. 
This  duration  is  assumed  to  be  bounded  by  a  constant 
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